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Description 

BACKGROUND OF THE INVENTION 

5 [0001] Adeno-associated virus (AAV), a member of the Parvovirus family, is a small nonenveloped, icosahedral virus 
with single-stranded linear DNA genomes of 4.7 kilobases (kb) to 6 kb. AAV is assigned to the genus, Dependovirus, 
because the virus was discovered as a contaminant in purified adenovirus stocks. AAV's life cycle includes a latent 
phase at which AAV genomes, after infection, are site specifically integrated into host chromosomes and an infectious 
phase in which, following either adenovirus or herpes simplex virus infection, the integrated genomes are subsequently 

10 rescued, replicated, and packaged into infectious viruses, The properties of non-pathogenicity, broad host range of 
infectivity, including non-dividing cells, and potential site-specific chromosomal integration make AAV an attractive tool 
for gene transfer. 

[0002] Recent studies suggest that AAV vectors may be the preferred vehicle for gene therapy. To date, there have 
been 6 different serotypes of AAVs isolated from human or non-human primates (NHP) and well characterized. Among 
15 them, human serotype 2 is the first AAV that was developed as a gene transfer vector; it has been widely used for 
efficient gene transfer experiments in different target tissues and animal models. Clinical trials of the experimental 
application of AAV2 based vectors to some human disease models are in progress, and include such diseases as 
cystic fibrosis and hemophilia B. 

[0003] What are desirable are AAV-based constructs for gene delivery. 

20 

SUMMARY OF THE INVENTION 

[0004] In one aspect, the invention provides a novel method of detecting and identifying AAV sequences from cellular 
DNAs of various human and non-human primate (NHP) tissues using bioinformatics analysis, PCR based gene am- 
25 plification and cloning technology, based on the nature of latency and integration of AAVs in the absence of helper 
virus co-infection. 

[0005] In another aspect, the invention provides method of isolating novel AAV sequences detected using the above 
described method of the invention. The invention further comprises methods of generating vectors based upon these 
novel AAV serotypes, for serology and gene transfer studies solely based on availability of capsid gene sequences 
30 and structure of rep/cap gene junctions. 

[0006] In still another aspect, the invention provides a novel method for performing studies of serology, epidemiology, 
biodistribution and mode of transmission, using reagents according to the invention, which include generic sets of 
primers/probes and quantitative real time PCR. 

[0007] In yet another aspect, the invention provides a method of isolating complete and infectious genomes of novel 
35 AAV serotypes from cellular DNA of different origins using RACE and other molecular techniques. 

[0008] In a further aspect, the invention provides a method of rescuing novel serotypes of AAV genomes from human 
and NHP cell lines using adenovirus helpers of different origins. 

[0009] In still a further aspect, the invention provides novel AAV serotypes, vectors containing same, and methods 
of using same. 

40 [0010] These and other aspects of the invention will be readily apparent from the following detailed description of 
the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
45 [0011] 

Figs. 1A through 1AAAR provide an alignment of the nucleic acid sequences encoding at least the cap proteins 
for the AAV serotypes. The full-length sequences including the ITRs, the rep region, and the capsid region are 
provided for novel AAV serotype 7 [SEQ ID NO:1], and for previously published AAV1 [SEQ IN NO:6], AAV2 [SEQ 

50 ID NO:7]; and AAVS [SEQ ID NO:8] Novel AAV serotypes AAVS [SEQ ID NO:4] and AAV9 [SEQ ID NO:5] are the 

subject of co-filed applications. The other novel clones of the invention provided in this alignment include: 42-2 
[SEQ ID NO:9], 42-8 [SEQ ID NO:27], 42-15 [SEQ ID NO:28], 42-5b [SEQ ID NO: 29], 42-1 b [SEQ ID NO:30]; 
42-13 [SEQ ID NO: 31], 42-3a [SEQ ID NO: 32], 42-4 [SEQ ID NO:33], 42-5a [SEQ ID NO: 34], 42-10 [SEQ ID 
NO:35], 42-3b [SEQ ID NO: 36], 42-1 1 [SEQ ID NO: 37], 42-6b [SEQ ID NO:38], 43-1 [SEQ ID NO: 39], 43-5 [SEQ 

55 ID NO: 40], 43-12 [SEQ ID NO:41], 43-20 [SEQ ID NO:42], 43-21 [SEQ ID NO: 43], 43-23 [SEQ ID NO:44], 43-25 

[SEQ ID NO: 45], 44.1 [SEQ ID NO:47], 44.5 [SEQ ID NO:47], 223.10 [SEQ ID NO:48], 223.2 [SEQ ID NO:49], 
223.4 [SEQ ID NO:50], 223.5 [SEQ ID NO: 51], 223.6 [SEQ ID NO: 52], 223.7 [SEQ ID NO: 53], A3.4 [SEQ ID 
NO: 54], A3.5 [SEQ ID NO:55], A3.7 [SEQ ID NO: 56], A3.3 [SEQ ID NO:57], 42.12 [SEQ ID NO: 58], 44.2 [SEQ 
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ID NO: 59]. The nucleotide sequences of the signature regions of AAV10 [SEQ ID NO: 117], AAV11 [SEQ ID NO: 
118] and AAV12 [SEQ ID NO: 11 9] are provided in this figure. Critical landmarks in the structures of AAV genomes 
are shown. Gaps are dennonstrated by dots. The 3' ITR of AAV1 [SEQ ID NO:6] is shown in the same configuration 
as In the published sequences. TRS represents terminal resolution site. Notice that AAV7 Is the only AAV reported 

5 that uses GTG as the initiation codon for VPS. 

Figs. 2A through 2F are an alignment of the amino acid sequences of the proteins of the vpl capsid proteins of 
previously published AAV serotypes 1 [SEQ ID NO:64], AAV2 [SEQ ID NO:70], AAV3 [SEQ ID NO: 71], AAV4 
[SEQ ID NO:63], AAV5 [SEQ ID NO:114], and AAV6 [SEQ ID NO:65] and novel AAV sequences of the Invention, 
including: 01 [SEQ ID NO:60], 02 [SEQ ID NO:61 ], 05 [SEQ ID NO:62], A3-3 [SEQ ID NO:66], A3-7 [SEQ ID NO: 

10 67], A3-4 [SEQ ID NO:68], A3-5 [SEQ ID NO: 69], 3.3b [SEQ ID NO: 62], 223.4 [SEQ ID NO: 73], 223-5 [SEQ ID 

NO:74], 223-10 [SEQ ID NO:75], 223-2 [SEQ ID NO:76], 223-7 [SEQ ID NO: 77], 223-6 [SEQ ID NO: 78], 44-1 
[SEQ ID NO: 79], 44-5 [SEQ ID NO:80], 44-2 [SEQ ID NO:81], 42-15 [SEQ ID NO: 84], 42-8 [SEQ ID NO: 85], 
42-13 [SEQ ID NO:86], 42-3A [SEQ ID NO:87], 42-4 [SEQ ID NO:88], 42-5A [SEQ ID NO:89], 42-1 B [SEQ ID NO: 
90], 42-5B [SEQ ID N0:91], 43-1 [SEQ ID NO: 92], 43-12 [SEQ ID NO: 93], 43-5 [SEQ ID NO:94], 43-21 [SEQ ID 

15 NO:96], 43-25 [SEQ ID NO: 97], 43-20 [SEQ ID NO:99], 24.1 [SEQ ID NO: 1 01 ], 42.2 [SEQ ID NO:102], 7,2 [SEQ 

ID NO: 1 03], 27.3 [SEQ ID NO: 1 04], 1 6.3 [SEQ ID NO: 1 05], 42.1 0 [SEQ ID NO: 1 06], 42-38 [SEQ ID NO: 1 07], 
42-11 [SEQ ID NO: 108], F1 [SEQ ID NO: 109], F5 [SEQ ID NO: 110], F3 [SEQ ID NO:111], 42-6B [SEQ ID NO: 
112], 42-12 [SEQ ID NO: 113]. Novel serotypes AAV8 [SEQ ID NO:95] and AAV9 [SEQ ID NO: 100] are the subject 
of co-filed patent applications. 

20 Figs. 3A through 30 provide the amino acid sequences of the AAV7 rep proteins [SEQ ID NO:3]. 

DETAILED DESCRIPTION OF THE INVENTION 

[0012] In the present invention, the inventors have found a method which takes advantage of the ability of adeno- 

25 associated virus (AAV) to penetrate the nucleus, and, In the absence of a helper virus co-lnfectlon, to integrate Into 
cellular DNA and establish a latent infection. This method utilizes a polymerase chain reaction (PCR)-based strategy 
for detection, identification and/or isolation of sequences of AAVs from DNAs from tissues of human and non-human 
primate origin as well as from other sources. Advantageously, this method is also suitable for detection. Identification 
and/or isolation of other integrated viral and non-viral sequences, as described below. 

30 [0013] The invention further provides nucleic acid sequences identified according to the methods of the invention. 
One such adeno-associated virus is of a novel serotype, termed herein serotype 7 (AAV7). Other novel adeno-asso- 
ciated virus serotypes provided herein include AAV10, AAV11 , and AAV12. Still other novel AAV serotypes identified 
accordingto the methods of the Invention are provided in the present specification. See, Figures and Sequence Listing, 
which is incorporated by reference. 

35 [0014] Also provided are fragments of these AAV sequences. Among particularly desirable AAV fragments are the 
cap proteins, including the vpl , vp2, vp3, the hypervariable regions, the rep proteins, including rep 78, rep 68, rep 52, 
and rep 40, and the sequences encoding these proteins. Each of these fragments may be readily utilized in a variety 
of vector systems and host cells. Such fragments may be used alone. In combination with other AAV sequences or 
fragments, or in combination with elements from other AAV or non-AAV viral sequences. In one particularly desirable 

40 embodiment, a vector contains the AAV cap and/or rep sequences of the invention. 

[0015] As described herein, alignments are performed using any of a variety of publicly or commercially available 
Multiple Sequence Alignment Programs, such as "Clustal W", accessible through Web Servers on the Internet. Alter- 
natively, Vector NTI utilities are also used. There are also a number of algorithms known In the art which can be used 
to measure nucleotide sequence identity, including those contained in the programs described above. As another 

45 example, polynucleotide sequences can be compared using Fasta, a program in GOG Version 6.1 . Fasta provides 
alignments and percent sequence identity of the regions of the best overlap between the query and search sequences. 
For instance, percent sequence identity between nucleic acid sequences can be determined using Fasta with its default 
parameters (a word size of 6 and the NOPAM factor for the scoring matrix) as provided in GOG Version 6.1 , herein 
incorporated by reference. Similar programs are available for amino acid sequences, e.g., the "Clustal X" program. 

50 Generally, any of these programs are used at default settings, although one of skill in the art can alter these settings 
as needed. Alternatively, one of skill in the art can utilize another algorithm or computer program which provides at 
least the level of identity or alignment as that provided by the referenced algorithms and programs. 
[0016] The term "substantial homology" or "substantial similarity," when referring to a nucleic acid, orfragment there- 
of , Indicates that, when optimally aligned with appropriate nucleotide Insertions or deletions with another nucleic acid 

55 (or Its complementary strand), there Is nucleotide sequence identity in at least about 95 to 99% of the aligned sequenc- 
es. Preferably, the homology Is overfull-length sequence, or an open reading frame thereof, or another suitable frag- 
ment which is at least 15 nucleotides in length. Examples of suitable fragments are described herein. 
[0017] The term "substantial homology" or "substantial similarity," when referring to amino acids orfragments thereof. 



3 



EP1 310 571 A2 



indicates tliat, when optimally aligned with appropriate amino acid insertions or deletions with another amino acid, 
there is amino acid sequence identity in at least about 95 to 99% of the aligned sequences. Preferably, the homology 
is over full-length sequence, or a protein thereof, e.g., a cap protein, a rep protein, or a fragment thereof which is at 
least 8 amino acids, or more desirably, at least 15 amino acids in length. Examples of suitable fragments are described 

5 herein. 

[0018] By the term "highly conserved" is meant at least 80% identity, preferably at least 90% identity, and more 
preferably, over 97% identity. Identity is readily determined by one of sl<ill in the art by resort to algorithms and computer 
programs known by those of skill in the art. 

[0019] The tenn "percent sequence identity" or "identical" in the context of nucleic acid sequences refers to the 
10 residues in the two sequences which are the same when aligned for maximum correspondence. The length of sequence 
identity comparison may be overthe full-length of the genome, the full-length of a gene coding sequence, or a fragment 
of at least about 500 to 5000 nucleotides, is desired. However, identity among smallerfragments, e.g. of at least about 
nine nucleotides, usually at least about 20 to 24 nucleotides, at least about 28 to 32 nucleotides, at least about 36 or 
more nucleotides, may also be desired. Similarly, "percent sequence identity" may be readily determined for amino 
f5 acid sequences, over the full-length of a protein, or a fragment thereof. Suitably, a fragment is at least about 8 amino 
acids in length, and may be up to about 700 amino acids. Examples of suitable fragments are described herein. 
[0020] The AAV sequences and fragments thereof are useful in production of rAAV, and are also useful as antlsense 
delivery vectors, gene therapy vectors, or vaccine vectors. The invention further provides nucleic acid molecules, gene 
delivery vectors, and host cells which contain the AAV sequences of the invention. 
20 [0021] As described herein, the vectors of the invention containing the AAV capsid proteins of the invention are 
particularly well suited for use in applications in which the neutralizing antibodies diminish the effectiveness of other 
AAV serotype based vectors, as well as other viral vectors. The rAAV vectors of the invention are particularly advan- 
tageous in rAAV readmlnistration and repeat gene therapy. 

[0022] These and other embodiments and advantages of the Invention are described In more detail below. As used 
25 throughout this specification and the claims, the tenns "comprising" and "including" and their variants are inclusive of 
other components, elements, integers, steps and the like. Conversely, the term "consisting" and its variants is exclusive 
of other components, elements, integers, steps and the like. 

I. Methods of the Invention 

30 

A. Detection of Sequences Via Molecular Cloning 

[0023] In one aspect, the invention provides a method of detecting and/or identifying target nucleic acid sequences 
in a sample. This method is particularly well suited for detection of viral sequences which are integrated Into the chro- 

35 mosome of a cell, e.g., adeno-assoclated viruses (AAV) and retroviruses, among others. The specification makes 
reference to AAV, which is exemplified herein. However, based on this information, one of skill in the art may readily 
perform the methods of the invention on retroviruses [e.g., feline leukemia virus (FeLV), HTLVI and HTLVII], and len- 
tivirinae [e.g., human immunodeficiency virus (HIV), simian immunodeficiency virus (SIV), feline immunodeficiency 
virus (FIV), equine infectious anemia virus, and spumavirinal)], among others. Further, the method of the invention 

40 may also be used for detection of other viral and non-viral sequences, whether integrated or non-integrated into the 
genome of the host cell. 

[0024] As used herein, a sample is any source containing nucleic acids, e.g., tissue, tissue culture, cells, cell culture, 
and biological fluids including, without limitation, urine and blood. These nucleic acid sequences may be DNA or RNA 
from plasmids, natural DNA or RNA from any source, including bacteria, yeast, viruses, and higher organisms such 

45 as plants or animals. DNA or RNA is extracted from the sample by a variety of techniques known to those of skill in 
the art, such as those described by Sambrook, Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor 
Laboratory). The origin of the sample and the method by which the nucleic acids are obtained for application of the 
method of the invention is not a limitation of the present invention. Optionally, the method of the invention can be 
performed directly on the source of DNA, or on nucleic acids obtained (e.g., extracted) from a source. 

50 [0025] The method of the invention involves subjecting a sample containing DNA to amplification via polymerase 
chain reaction (PCR) using a first set of primers specific for a first region of double-stranded nucleic acid sequences, 
thereby obtaining amplified sequences. 

[0026] As used herein, each of the "regions" is predetermined based upon the alignment of the nucleic acid sequenc- 
es of at least two serotypes (e.g., AAV) or strains (e.g., lentiviruses), and wherein each of said regions is composed 
55 of sequences having a 5' end which is highly conserved, a middle which is preferably, but necessarily, variable, and a 
3' end which is highly conserved, each of these being conserved or variable relative to the sequences of the at least 
two aligned AAV serotypes. Preferably, the 5' and/or 3' end is highly conserved over at least about 9, and more pref- 
erably, at least 18 base pairs (bp). However, one or both of the sequences at the 5' or 3' end may be conserved over 
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more than 1 8 bp, more than 25 bp, more than 30 bp, or more than 50 bp at the 5' end. With respect to the variable 
region, there is no requirement for conserved sequences, these sequences may be relatively conserved, or may have 
less than 90, 80, or 70% identity among the aligned serotypes or strains. 

[0027] Each of the regions may span about 100 bp to about 10 kilobase pairs in length. However, it is particularly 
5 desirable that one of the regions is a "signature region", i.e., a region which is sufficiently unique to positively identify 
the amplified sequence as being from the target source. For example, in one embodiment, the first region is about 250 
bp in length, and is sufficiently unique among known AAV sequences, that it positively identifies the amplified region 
as being of AAV origin. Further, the variable sequences within this region are sufficiently unique that can be used to 
identify the serotype from which the amplified sequences originate. Once amplified (and thereby detected), the se- 
10 quences can be identified by performing conventional restriction digestion and comparison to restriction digestion pat- 
terns for this region in any of AAV1, AAV2, AAV3, AAV4, AAV5, or AAV6, or that of AAV7, AAV10, AAV11, AAV12, or 
any of the other novel serotypes identified by the invention, which is predetermined and provided by the present in- 
vention. 

[0028] Given the guidance provided herein, one of skill in the art can readily identify such regions among other 
f5 integrated viruses to permit ready detection and identification of these sequences. Thereafter, an optimal set of generic 
primers located within the highly conserved ends can be designed and tested for efficient amplification of the selected 
region from samples. This aspect of the invention is readily adapted to a diagnostic kit for detecting the presence of 
the target sequence (e.g., AAV) and for identifying the AAV serotype, using standards which include the restriction 
patterns for the AAV serotypes described herein or isolated using the techniques described herein. For example, quick 
20 identification or molecular serotyping of PGR products can be accomplished by digesting the PGR products and com- 
paring restriction patterns. 

[0029] Thus, in one embodiment, the "signature region" for AAV spans about bp 2800 to about 3200 of AAV 1 [SEQ 
ID NO:6], and corresponding base pairs in AAV 2, AAV3, AAV4, AAV5, and AAV6. More desirably, the region is about 
250 bp, located within bp 2886 to about 3143 bp of AAV 1 [SEQ ID NO:6], and corresponding base pairs in AAV 2 
25 [SEQ ID NO:7], AAV3 [SEQ ID N08], and other AAV serotypes. See, Fig. 1 . To permit rapid detection of AAV in the 

sample, primers which specifically amplify this signature region are utilized. However, the present invention is not 
limited to the exact sequences identified herein for the AAV signature region, as one of skill in the art may readily alter 
this region to encompass a shorter fragment, or a larger fragment of this signature region. 

[0030] The PGR primers are generated using techniques known to those of skill in the art. Each of the PGR primer 
30 sets is composed of a 5' primer and a 3' primer. See, e.g., Sambrook et al, cited herein. The term "primer" refers to an 
oligonucleotide which acts as a point of initiation of synthesis when placed under conditions in which synthesis of a 
primer extension product which is complementary to a nucleic acid strand is induced. The primer is preferably single 
stranded. However, if a double stranded primer is utilized, it is treated to separate its strands before being used to 
prepare extension products. The primers may be about 15 to 25 or more nucleotides, and preferably at least 18 nu- 
35 cleotides. However, for certain applications shorter nucleotides, e.g., 7 to 15 nucleotides are utilized. 

[0031] The primers are selected to be sufficiently complementary to the different strands of each specific sequence 
to be amplified to hybridize with their respective strands. Therefore, the primer sequence need not reflect the exact 
sequence of the region being amplified. For example, a non-complementary nucleotide fragment may be attached to 
the 5' end of the primer, with the remainder of the primer sequence being completely complementary to the strand. 
40 Alternatively non-complementary bases or longer sequences can be interspersed into the primer, provided that the 
primer sequence has sufficient complementarity with the sequence of the strand to be amplified to hybridize therewith 
and form a template for synthesis of the extension product of the other primer. 

[0032] The PGR primers for the signature region according to the invention are based upon the highly conserved 
sequences of two or more aligned sequences (e.g. , two or more AAV serotypes). The primers can accommodate less 

45 than exact identity among the two or more aligned AAV serotypes at the 5' end or in the middle. However, the sequences 
at the 3' end of the primers correspond to a region of two or more aligned AAV serotypes in which there is exact identity 
over at least five, preferably, over at least nine base pairs, and more preferably, over at least 18 base pairs at the 3' 
end of the primers. Thus, the 3' end of the primers is composed of sequences with 100% identity to the aligned se- 
quences over at least five nucleotides. However, one can optionally utilize one, two, or more degenerate nucleotides 

50 at the 3' end of the primer. 

[0033] For example, the primer set for the signature region of AAV was designed based upon a unique region within 
the AAV capsid, as follows. The 5' primer was based upon nt 2867-2891 of AAV2 [SEQ ID NO:7], 
5'-GGTAATTGGTGGGGAAATTGGGATT3'. See, Fig. 1 . The3' primer was designed based upon nt 3096-3122 of AAV2 
[SEQ ID N0:7], 5'-GACTGATGAACAAGAAGTGGGGATTG-3'. However, one of skill in the art may have readily de- 

55 signed the primer set based upon the corresponding regions of AAV1 , AAV3, AAV4, AAV5, AAV6, or based upon the 
infoHTiation provided herein, AAV7, AAV10, AAV11 , AAV12, or another novel AAV of the invention. In addition, still other 
primer sets can be readily designed to amplify this signature region, using techniques known to those of skill in the art. 
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B. Isolation of Target Sequences 

[0034] As described herein, the present invention provides a first primerset which specifically amplifies the signature 
region of the target sequence, e.g., an AAV serotype, in order to permit detection of the target. In a situation in which 
5 further sequences are desired, e.g., if a novel AAV serotype is identified, the signature region maybe extended. Thus, 
the invention may further utilize one or more additional primer sets. 

[0035] Suitably, these primer sets are designed to include either the 5' or 3' primer of the first primerset and a second 
primer unique to the primerset, such that the primerset amplifies a region 5' or 3' to the signature region which anneals 
to either the 5' end or the 3' end of the signature region. For example, a first primer set is composed of a 5' primer, P1 

10 and a 3' primer P2 to amplify the signature region. In order to extend the signature region on its 3' end, a second primer 
set is composed of primer P1 and a 3' primer P4, which amplifies the signature region and contiguous sequences 
downstream of the signature region. In order to extend the signature region on its 5' end, a third primerset is composed 
of a 5' primer, P5, and primer P2, such that the signature region and contiguous sequences upstream of the signature 
region are amplified. These extension steps are repeated (or performed at the same time), as needed or desired. 

15 Thereafter, the products results from these amplification steps are fused using conventional steps to produce an isolated 
sequence of the desired length. 

[0036] The second and third primer sets are designed, as with the primer set for the signature region, to amplify a 
region having highly conserved sequences among the aligned sequences. Reference herein to the term "second" or 
"third" primer set is for each of discussion only, and without regard to the order in which these primers are added to 

20 the reaction mixture, or used for amplification. The region amplified by the second primer set is selected so that upon 
amplification it anneals at its 5' end to the 3' end of the signature region. Similarly, the region amplified by the third 
primer set is selected so that upon amplification it anneals at its 3' end anneals to the 5' end of the signature region. 
Additional primer sets can be designed such that the regions which they amplify anneal to the either the 5' end or the 
3' end of the extension products formed by the second or third primer sets, or by subsequent primer sets. 

25 [0037] For example, where AAV is the target sequence, a first set of primers (P1 and P2) are used to amplify the 
signature region from the sample. In one desirable embodiment, this signature region is located within the AAV capsid. 
A second set of primers (P1 and P4) is used to extend the 3' end ofthe signature region to a location in the AAV 
sequence which is just before the AAV 3' ITR, i.e., providing an extension product containing the entire 3' end ofthe 
AAV capsid when using the signature region as an anchor. In one embodiment, the P4 primer corresponds to nt 4435 

30 to 4462 of AAV2 [SEQ ID NO:7], and corresponding sequences in the other AAV serotypes. This results in amplification 
of a region of about 1 .6 l<b, which contains the 0.25 l<b signature region. A third set of primers (P3 and P2) is used to 
extend the 5' end of signature region to a location in the AAV sequences which is in the 3' end of the rep genes, i.e., 
providing an extension product containing the entire 5' end of the AAV capsid when using the signature region as an 
anchor. In one embodiment, the P3 primer corresponds to nt 1 384 to 1 409 of AAV2 [SEQ ID NO:7], and corresponding 

35 sequences in the other AAV serotypes. This results in amplification of a region of about 1 .7 kb, which contains the 0.25 
l<b signature region. Optionally, a fourth set of primers are used to further extend the extension product containing the 
entire 5' end of the AAV capsid to also include the rep sequences. In one embodiment, the primer designated P5 
corresponds to nt 108 to 133 of AAV2 [SEQ ID NO:7], and corresponding sequences in the other AAV serotypes and 
is used in conjunction with the P2 primer. 

40 [0038] Following completion of the desired number of extension steps, the various extension products are fused, 
mal<ing use of the signature region as an anchor or marker, to construct an intact sequence. In the example provided 
herein, AAV sequences containing, at a minimum, an intact AAV cap gene are obtained. Larger sequences may be 
obtained, depending upon the number of extension steps performed. 

[0039] Suitably, the extension products are assembled into an intact AAV sequence using methods known to those 
45 of skill in the art. For example, the extension products may be digested with Dralll, which cleaves at the Dralll site 
located within the signature region, to provide restriction fragments which are re-ligated to provide products containing 
(at a minimum) an intact AAV cap gene. However, other suitable techniques for assembling the extension products 
into an intact sequence may be utilized. See, generally, Sambrook et al, cited herein. 

[0040] As an alternative to the multiple extension steps described above, another embodiment of the invention pro- 
50 vides for direct amplification of a 3.1 kb fragment which allows isolation of full-length cap sequences. To directly amplify 
a 3.1 kb full-length cap fragment from NHP tissue and blood DMAs, two other highly conserved regions were identified 
in AAV genomes for use in PGR amplification of large fragments. A primer within a conserved region located in the 
middle of the rep gene is utilized (AVI ns: 5' GCTGCGTCAACTGGACCAATGAGAAC 3', nt of SEQ ID NO:6) in com- 
bination with the 3' primer located in another conserved region downstream of the Cap gene (AV2cas: 5' CGCAGA- 
55 GACCAAAGTTCAACTGAAACGA 3', SEQ ID NO: 7) for amplification of AAV sequences including the full-length AAV 
cap. Typically, following amplification, the products are cloned and sequence analysis is performed with an accuracy 
of > 99.9%. Using this method, the inventors have isolated at least 50 capsid clones which have subsequently been 
characterized. Among them, 37 clones were derived from Rhesus macaque tissues (rh.1 - rh.37), 6 clones from cy- 
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nomologous macaques (cy.1 - cy.6), 2 clones from Baboons (bb.1 and bb.2) and 5 clones from Chimps (ch.1 - ch.5). 
These clones are identified elsewhere in the specification, together with the species of animal from which they were 
identified and the tissues in that animal these novel sequences have been located. 

5 C. Alternative method for isolating novel AAV 

[0041] In another aspect, the invention provides an alternative method for isolating novel AAV from a eel I. This method 
involves infecting the cell with a vector which provides helper functions to the AAV; isolating infectious clones containing 
AAV; sequencing the isolated AAV; and comparing the sequences of the isolated AAV to known AAV serotypes, whereby 

10 differences in the sequences of the isolated AAV and known AAV serotypes indicates the presence of a novel AAV. 
[0042] In one embodiment, the vector providing helper functions provides essential adenovirus functions, including, 
e.g., E1a, E1b, E2a, E40RF6. In one embodiment, the helper functions are provided by an adenovirus. The adenovirus 
may be a wild-type adenovirus, and may be of human or non-human origin, preferably non-human primate (NHP) 
origin. The DNA sequences of a number of adenovirus types are available from Genbank, including type Ad5 [Genbank 

15 Accession No. M73260]. The adenovirus sequences may be obtained from any known adenovirus serotype, such as 
serotypes 2, 3, 4, 7, 12 and 40, and further including any of the presently identified human types [see, e.g., Honwitz, 
cited above]. Similarly adenoviruses known to infect non-human animals (e.g., chimpanzees) may also be employed 
in the vector constructs of this invention, See, e.g., US Patent No. 6,083,716. In addition to wild-type adenoviruses, 
recombinant viruses or non-viral vectors (e.g., plasmids, episomes, etc.) carrying the necessary helper functions may 

20 be utilized. Such recombinant viruses are known in the art and may be prepared according to published techniques. 
See, e.g., US Patent No. 5,871 ,982 and US Patent 6,251 ,677, which describe a hybrid Ad/AAV virus. The selection of 
the adenovirus type is not anticipated to limit the following invention. A variety of adenovirus strains are available from 
the American Type Culture Collection, IN^anassas, Virginia, or available by request from a variety of commercial and 
institutional sources. Further, the sequences of many such strains are available from a variety of databases including, 

25 e.g., PubMed and GenBank. 

[0043] In another alternative, infectious AAV may be isolated using genome walking technology (Siebert etal., 1995, 
Nucleic Acid Research, 23:1087-1088, Friezner-Degen etal., 1986, J. Biol. Chem. 261:6972-6985, BD Biosciences 
Clontech, Palo Alto, CA). Genome walking is particularly well suited for identifying and isolating the sequences adjacent 
to the novel sequences identified according to the method of the invention. For example, this technique may be useful 

30 for isolating inverted terminal repeat (ITRs) of the novel AAV serotype, based upon the novel AAV capsid and/or rep 
sequences identified using the methods of the invention. This technique is also useful for isolating sequences adjacent 
to other AAV and non-AAV sequences identified and isolated according to the present invention . See, Examples 3 and 4. 
[0044] The methods of the invention may be readily used for a variety of epidemiology studies, studies of biodistri- 
bution, monitoring of gene therapy via AAV vectors and vector derived from other integrated viruses. Thus, the methods 

35 are well suited for use in pre-packaged kits for use by clinicians, researchers, and epidemiologists. 

II. Diagnostic Kit 

[0045] In another aspect, the invention provides a diagnostic kit for detecting the presence of a known or unknown 

40 adeno-associated virus (AAV) in a sample. Such a kit may contain a first set of 5' and 3' PCR primers specific for a 
signature region of the AAV nucleic acid sequence. Alternatively, or additionally, such a kit can contain a first set of 5' 
and 3' PCR primers specific for the 3.1 kb fragment which includes the full-length AAV capsid nucleic acid sequence 
identified herein (e.g., the AV1ns and AV2cas primers.) Optionally, a kit of the invention may further contain two or 
more additional sets of 5' and 3' primers, as described herein, and/or PCR probes. These primers and probes are used 

45 according to the present invention amplify signature regions of each AAV serotype, e.g., using quantitative PCR. 

[0046] The invention further provides a kit useful for identifying an AAV serotype detected according to the method 
of the invention and/or for distinguishing novel AAV from known AAV. Such a kit may further include one or more 
restriction enzymes, standards for AAV serotypes providing their "signature restriction enzyme digestions analyses", 
and/or other means for determining the serotype of the AAV detected. 

50 [0047] In addition, kits of the invention may include, instructions, a negative and/or positive control, containers, dilu- 
ents and buffers for the sample, indicator charts for signature comparisons, disposable gloves, decontamination in- 
structions, applicator sticks or containers, and sample preparator cups, as well as any desired reagents, including 
media, wash reagents and concentration reagents. Such reagents may be readily selected from among the reagents 
described herein, and from among conventional concentration reagents. In one desirable embodiment, the wash rea- 

55 gent is an isotonic saline solution which has been buffered to physiologic pH, such as phosphate buffered saline (PBS); 
the elution reagent is PBS containing 0.4 M NaCI, and the concentration reagents and devices. For example, one of 
skill in the art will recognize that reagents such as polyethylene glycol (PEG), or NH4SO4 may be useful, orthat devices 
such as filter devices. For example, a filter device with a 100 K membrane would concentrate rAAV. 
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[0048] The kits provided by the present invention are useful for performing the nnethods described herein, and for 
study of biodistribution, epidemiology, mode of transmission of novel AAV serotypes in human and NHPs. 
[0049] Thus, the methods and kits of the invention permit detection, identification, and isolation of target viral se- 
quences, particularly integrated viral sequences. The methods and kits are particularly well suited for use in detection, 

5 identification and isolation of AAV sequences, which may include novel AAV serotypes. 

[0050] In one notable example, the method of the invention facilitated analysis of cloned AAV sequences by the 
inventors, which revealed heterogeneity of proviral sequences between cloned fragments from different animals, all 
of which were distinct from the known six AAV serotypes, with the majority of the variation localized to hypervariable 
regions of thecapsid protein. Surprising divergence of AAV sequences was noted in clones isolated from single tissue 

10 sources, such as lymph node, from an individual rhesus monkey. This heterogeneity is best explained by apparent 
evolution of AAV sequence within individual animals due, in part, to extensive homologous recombination between a 
limited number of co-infecting parenteral viruses. These studies suggest sequence evolution of widely disseminated 
virus during the course of a natural AAV infection that presumably leads to the formation of swarms of quasispecies 
which differ from one another in the array of capsid hypervariable regions. This is the first example of rapid molecular 

15 evolution of a DNA virus in a way that formerly was thought to be restricted to RNA viruses. 

[0051] Sequences of several novel AAV serotypes identified by the method of the invention and characterization of 
these serotypes is provided. 

III. Novel AAV Serotypes 

20 

A. Nucleic Acid Sequences 

[0052] Nucleic acid sequences of novel AAV serotypes identified by the methods of the invention are provided. See, 
SEQ ID NO:1, 9 - 59, and 117 - 120, which are incorporated by reference herein. See also. Fig. 1 and the sequence 
25 listing. 

[0053] For novel serotype AAV7, the full-length sequences, including the AAV 5' ITRs, capsid, rep, and AAV 3' ITRs 

are provided in SEQ ID NO:1 . 

[0054] For other novel AAV serotypes of the invention, the approximately 3.1 kb fragment isolated according to the 
method of the invention is provided. This fragment contains sequences encoding full-length capsid protein and all or 

30 part of the sequences encoding the rep protein. These sequences include the clones identified below. 

[0055] For still other novel AAV serotypes, the signature region encoding the capsid protein is provided. For example, 
the AAV10 nucleic acid sequences of the invention include those illustrated in Fig. 1 [See, SEQ ID NO: 11 7, which 
spans 255 bases]. The AAV11 nucleic acid sequences of the invention include the DNA sequences illustrated in Fig. 
1 [See, SEQ ID NO:1 18 which spans 258 bases]. The AAV1 2 nucleic acid sequences of the invention include the DNA 

35 sequences illustrated in Fig. 1 [See, SEQ ID NO: 11 9, which consists of 255 bases ]. Using the methodology described 
above, further AAV10, AAV11 and AAV12 sequences can be readily identified and used for a variety of purposes, 
including those described for AAV7 and the other novel serotypes herein. 

[0056] Figure 1 provides the non-human primate (NHP) AAV nucleic acid sequences of the invention in an alignment 
with the previously published AAV serotypes, AAV 1 [SEQ ID NO:6], AAV2 [SEQ ID NO:7], and AAV3 [SEQ ID NO:8]. 
40 These novel NHP sequences include those provided in the following Table I, which are identified by clone number: 



45 



50 



55 
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Table 1 



AAV CfiM 


"NJlim Kf»r 


i3L/Ui L/C 








Snecies 


Tissue 


SEQ ID NO 
(DNA) 


Rh.l 


Clone 9 
(AAV9) 


Rhesus 


Heart 


5 


Rh.2 


Clone 43.1 


Rhesus 


MLN 


39 


Rh.3 


Clone 43.5 


Rhesus 


MLN 


40 


Rh.4 


Clone 43.12 


Rhesus 


MLN 


41 


Rh.5 


Clone 43.20 


Rhesus 


MLN 


42 


Rh.6 


Clone 43.21 


Rhesus 


MLN 


43 


Rh.7 


Clone 43.23 


Rhesus 


MLN 


44 
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Table 1 (cont'd) 



Rh.8 


Clone 43.25 


Rhesus 


MLN 


45 


Rh.9 


Clone 44.1 


Rhesus 


Liver 


46 


Rh.lO 


Clone 44.2 


Rhesus 


Liver 


59 


Rh.ll 


Clone 44.5 


Rhesus 


Liver 


47 


Rh.l2 


Clone 
42.1B 


Rhesus 


MLN 


30 


Rh.l3 


42.2 


Rhesus 


MLN 


9 


Rh.l4 


Clone 
42.3A 


Rhesus 


MLN 


32 


Rh.l5 


Clone 
42.33 


Rhesus 


MLN 


36 


Rh.16 


Clone 42.4 


Rhesus 


MLN 


33 


Rh.l7 


Clone 
42. 5 A 


Rhesus 


MLN 


34 


Rh.l8 


Clone 
42.5B 


Rhesus 


MLN 


29 


Rh.l9 


Clone 
42.6B 


Rhesus 


MLN 


38 


Rh.20 


Clone 42.8 


Rhesus 


MLN 


27 


Rh.21 


Clone 42.10 


Rhesus 


MLN 


35 


Rh.22 


Clone 42.1 1 


Rhesus 


MLN 


37 


Rh.23 


Clone 42.12 


Rhesus 


MLN 


58 


Rh.24 


Clone 42.13 


Rhesus 


MLN 


31 


Rh.25 


Clone 42.15 


Rhesus 


MLN 


28 


Rh.26 


Clone 223.2 


Rhesus 


Liver 


49 


Rh.27 


Clone 223.4 


Rhesus 


Liver 


50 


Rh.28 


Clone 223.5 


Rhesus 


Liver 


51 


Rh.29 


Clone 223.6 


Rhesus 


Liver 


52 


Rh.30 


Clone 223.7 


Rhesus 


Liver 


53 


Rh.31 


Clone 
223.10 


Rhesus 


Liver 


48 


Rh.32 


Clone CI 


Rhesus 


Spleen, Duo, 
Kid & Liver 


19 


Rh.33 


Clone C3 


Rhesus 




20 


Rh.34 


Clone C5 


Rhesus 




21 


Rh.35 


Clone Fl 


Rhesus 


Liver 


22 


Rh.36 


Clone F3 


Rhesus 




23 


Rii.37 


Clone F5 


Rhesus 




24 


Cy.l 


Clone 1 .3 


Cyno 


Blood 


14 


Cy.2 


Clone 
13.3B 


Cyno 


Blood 


15 


Cy.3 


Clone 24.1 


Cyno 


Blood 


16 


Cy.4 


Clone 27.3 


Cyno 


Blood 


17 


Cy.5 


Clone 7.2 


Cyno 


Blood 


18 


Cy.6 


Clone 16.3 


Cyno 


Blood 


10 
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Table 1 (cont'd) 



bb.l 


Clone 29.3 


Baboon 


Blood 


11 


bb.2 


Clone 29.5 


Baboon 


Blood 


13 


Ch.l 


Clone A3.3 


Chimp 


Blood 


57 


Ch.2 


Clone A3.4 


Chimp 


Blood 


54 


Ch.3 


Clone A3.5 


Chimp 


Blood 


55 


Ch.4 


Clone A3. 7 


Chimp 


Blood 


56 



[0057] A novel NHP clone was made by splicing capsids fragments of two chimp adenoviruses into an AAV2 rep 
construct. This new clone, A3.1 , is also termed Ch.5 [SEQ ID NO:20]. Additionally, the present invention includes two 
human AAV sequences, termed H6 [SEQ ID NO:25] and H2 [SEQ ID NO:26]. 

15 [0058] The AAV nucleic acid sequences of the invention further encompass the strand which is complementary to 
the strands provided in the sequences provided in Fig. 1 and the Sequence Listing [SEQ ID NO:1, 9 - 59, 117-120], 
nucleic acid sequences, as well as the RNA and cDNA sequences corresponding to the sequences provided in Fig. 1 
and the Sequence Listing [SEQ ID NO:1, 9 - 59, 117-120], and their complementary strands. Also included in the 
nucleic acid sequences of the invention are natural variants and engineered modifications of the sequences of Figl 

20 and the Sequence Listing [SEQ ID NO:1 ,9-59, 117-120], and their complementary strands. Such modifications include, 
for example, labels which are known in the art, methylation, and substitution of one or more of the naturally occurring 
nucleotides with a degenerate nucleotide. 

[0059] Further included in this invention are nucleic acid sequences which are greater than 85%, preferably at least 
about 90%, more preferably at least about 95%, and most preferably at least about 98 to 99% identical or homologous 
25 to the sequences of the invention, including Fig. 1 and the Sequence Listing [SEQ ID NO:1, 9 - 59, 117-120]. These 
terms are as defined herein. 

[0060] Also included within the invention are fragments of the novel AAV sequences identified by the method de- 
scribed herein. Suitable fragments are at least 15 nucleotides in length, and encompass functional fragments, i.e., 
fragments which are of biological interest. In one embodiment, these fragments are fragments of the novel sequences 
30 of Fig. 1 and the Sequence Listing [SEQ ID N0:1, 9 - 59, 117-120], their complementary strands, cDNA and RNA 
complementary thereto. 

[0061] Examples of suitable fragments are provided with respect to the location of these fragments on AAV1 , AAV2, 
or AAV7. However, using the alignment provided herein (obtained using the Clustal W program at default settings), or 
similar techniques for generating an alignment with other novel serotypes of the invention, one of skill in the art can 

35 readily identify the precise nucleotide start and stop codons for desired fragments. 

[0062] Examples of suitable fragments include the sequences encoding the three variable proteins (vp) of the AAV 
capsid which are alternative splice variants: vpl [e.g., nt 825 to 3049 of AAV7, SEQ ID NO: 1]; vp2 [e.g., nt 1234 - 
3049 of AAV7, SEQ ID NO: 1]; and vp 3 [e.g., nt 1434 - 3049 of AAV7, SEQ ID NO:1]. It is notable that AAV7 has an 
unusual GTG start codon. With the exception of a few house-keeping genes, such a start codon has not previously 

40 been reported in DNA viruses. The start codons for vpl , vp2 and vp3 for other AAV serotypes have been believed to 
be such that they permit the cellular mechanism of the host cell in which they reside to produce vp1 , vp2 and vp3 in a 
ratio of 10%:10%:80%, respectively, in order to pemnit efficient assembly of the virion. However, the AAV7 virion has 
been found to assemble efficiently even with this rare GTG start codon. Thus, the inventors anticipate this it is desirable 
to alter the start codon of the vp3 of other AAV serotypes to contain this rare GTG start codon, in order to improve 

45 packaging efficiency, to alter the virion structure and/or to alter location of epitopes (e.g., neutralizing antibody epitopes) 
of other AAV serotypes. The start codons may be altered using conventional techniques including, e.g., site directed 
mutagenesis. Thus, the present invention encompasses altered AAV virions of any selected serotype, composed of a 
vp 3, and/or optionally, vp 1 and/or vp2 having start codons altered to GTG. 

[0063] Other suitable fragments of AAV, include a fragment containing the start codon for the AAV capsid protein [e. 

50 g., nt 468 to 3090 of AAV7, SEQ ID NO:1 , nt 725 to 3090 of AAV7, SEQ ID NO: 1 , and corresponding regions of the 
other AAV serotypes]. Still other fragments of AAV7 and the other novel AAV serotypes identified using the methods 
described herein include those encoding the rep proteins, including rep 78 [e.g., initiation codon 334 of Fig 1 for AAV7], 
rep 68 [initiation codon nt 334 of Fig. 1 for AAV7], rep 62 [initiation codon 1006 of Fig. 1 for AAV7], and rep 40 [initiation 
codon 1 006 of Fig. 1 for AAV7] Other fragments of interest may include the AAV 5' inverted terminal repeats ITRs, [nt 

55 1 to 107 of Fig. I for AAV7]; the AAV3' ITRs [nt 4704 to 4721 of Fig. 1 forAAV7], P1 9 sequences, AAV P40 sequences, 
the rep binding site, andthetenninal resolute site (TRS). Still other suitable fragments will be readily apparent to those 
of skill in the art. The corresponding regions in the other novel serotypes of the invention can be readily determined 
by reference to Figure 1 , or by utilizing conventional alignment techniques with the sequences provided herein. 
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[0064] In addition to including the nucleic acid sequences provided in the figures and Sequence Listing, the present 
invention includes nucleic acid molecules and sequences which are designed to express the amino acid sequences, 
proteins and peptides of the AAV serotypes of the invention. Thus, the invention includes nucleic acid sequences which 
encode the following novel AAV amino acid sequences: C1 [SEQ ID NO:60], C2 [SEQ ID NO:61], C5 [SEQ ID NO:62], 

5 A3-3 [SEQ ID NO:66], A3-7 [SEQ ID NO:67], A3-4 [SEQ ID NO:68], A3-5 [SEQ ID NO: 69], 3.3b [SEQ ID NO: 62], 
223.4 [SEQ ID NO: 73], 223-5 [SEQ ID NO:74], 223-10 [SEQ ID NO:75], 223-2 [SEQ ID NO:76], 223-7 [SEQ ID NO: 
77], 223-6 [SEQ ID NO: 78], 44-1 [SEQ ID NO: 79], 44-5 [SEQ ID NO:80], 44-2 [SEQ ID NO:81], 42-15 [SEQ ID NO: 
84], 42-8 [SEQ ID NO: 85], 42-13 [SEQ ID NO:86], 42-3A [SEQ ID NO:87], 42-4 [SEQ ID NO:88], 42-5A [SEQ ID NO: 
89], 42-1 B [SEQ ID NO:90], 42-5B [SEQ ID NO:91], 43-1 [SEQ ID NO: 92], 43-12 [SEQ ID NO: 93], 43-5 [SEQ ID NO: 

10 94], 43-21 [SEQ ID NO:96], 43-25 [SEQ ID NO: 97], 43-20 [SEQ ID NO:99], 24.1 [SEQ ID NO: 101], 42.2 [SEQ ID NO: 
102], 7.2 [SEQ ID NO: 103], 27.3 [SEQ ID NO: 104], 16.3 [SEQ ID NO: 105], 42.10 [SEQ ID NO: 106], 42-3B [SEQ ID 
NO: 107], 42-11 [SEQ ID NO: 108], F1 [SEQ ID NO: 109], F5 [SEQ ID NO: 110], F3 [SEQ ID NO:111], 42-6B[SEQ ID 
NO: 112], and/or 42-1 2 [SEQ ID NO: 113], and artificial AAV serotypes generated using these sequences and/or unique 
fragments thereof. 

15 [0065] As used herein, artificial AAV serotypes include, without limitation, AAV with a non-natu rally occurring capsid 
protein. Such an artificial capsid may be generated by any suitable technique, using a novel AAV sequence of the 
invention (e.g., a fragment of a vpl capsid protein) in combination with heterologous sequences which maybe obtained 
from another AAV serotype (known or novel), non-contiguous portions of the same AAV serotype, from a non-AAV viral 
source, or from a non-viral source. An artificial AAV serotype may be, without limitation, a chimeric AAV capsid, a 

20 recombinant AAV capsid, or a "humanized" AAV capsid. 

B. AAV Amino Acid Sequences, Proteins and Peptides 

[0066] The invention provides proteins and fragments thereof which are encoded by the nucleic acid sequences of 
25 the novel AAV serotypes identified herein, including, e.g., AAV7 [nt 825 to 3049 of AAV7, SEQ ID NO: 1] the other 
novel serotypes provided herein. Thus, the capsid proteins of the novel serotypes of the invention, including: H6 [SEQ 
ID NO: 25], H2 [SEQ ID NO: 26], 42-2 [SEQ ID NO:9], 42-8 [SEQ ID NO:27], 42-15 [SEQ ID NO:28], 42-5b [SEQ ID 
NO: 29], 42-1 b [SEQ ID NO:30]; 42-13 [SEQ ID NO: 31], 42-3a [SEQ ID NO: 32], 42-4 [SEQ ID NO:33], 42-5a [SEQ 
ID NO: 34], 42-10 [SEQ ID NO:35], 42-3b [SEQ ID NO: 36], 42-11 [SEQ ID NO: 37], 42-6b [SEQ ID NO:38], 43-1 [SEQ 
30 ID NO: 39], 43-5 [SEQ ID NO: 40], 43-12 [SEQ ID NO:41], 43-20 [SEQ ID NO:42], 43-21 [SEQ ID NO: 43], 43-23 [SEQ 
ID NO:44], 43-25 [SEQ ID NO: 45], 44.1 [SEQ ID NO:47], 44.5 [SEQ ID NO:47], 223.10 [SEQ ID NO:48], 223,2 [SEQ 
ID NO:49], 223.4 [SEQ ID NO:50], 223.5 [SEQ ID NO: 51], 223.6 [SEQ ID NO: 52], 223.7 [SEQ ID NO: 53], A3.4 [SEQ 
ID NO: 54], A3.5 [SEQ ID NO:55], A3.7 [SEQ ID NO: 56], A3.3 [SEQ ID NO:57], 42.12 [SEQ ID NO: 58], and 44.2 
[SEQ ID NO: 59], can be readily generated using conventional techniques from the open reading frames provided for 
35 the above-listed clones. 

[0067] The invention further encompasses AAV serotypes generated using sequences of the novel AAV serotypes 
of the invention, which are generated using synthetic, recombinant or other techniques known to those of skill in the 
art. The invention is not limited to novel AAV amino acid sequences, peptides and proteins expressed from the novel 
AAV nucleic acid sequences of the invention and encompasses amino acid sequences, peptides and proteins gener- 
ic ated by other methods known in the art, including, e.g., by chemical synthesis, by other synthetic techniques, or by 
other methods. For example, the sequences of any of 01 [SEQ ID NO:60], 02 [SEQ ID N0:61], 05 [SEQ ID NO:62], 
A3-3 [SEQ ID NO:66], A3-7 [SEQ ID NO:67], A3-4 [SEQ ID NO:68], A3-5 [SEQ ID NO: 69], 3.3b [SEQ ID NO: 62], 
223.4 [SEQ ID NO: 73], 223-5 [SEQ ID NO:74], 223-10 [SEQ ID NO:75], 223-2 [SEQ ID NO:76], 223-7 [SEQ ID NO: 
77], 223-6 [SEQ ID NO: 78], 44-1 [SEQ ID NO: 79], 44-5 [SEQ ID NO:80], 44-2 [SEQ ID NO:81], 42-15 [SEQ ID NO: 
45 84], 42-8 [SEQ ID NO: 85], 42-1 3 [SEQ ID NO:86], 42-3A [SEQ ID NO:87], 42-4 [SEQ ID NO:88], 42-5A [SEQ ID NO: 
89], 42-1B [SEQ ID NO:90], 42-5B [SEQ ID NO:91], 43-1 [SEQ ID NO: 92], 43-12 [SEQ ID NO: 93], 43-5 [SEQ ID NO: 
94], 43-21 [SEQ ID NO:96], 43-25 [SEQ ID NO: 97], 43-20 [SEQ ID NO:99], 24.1 [SEQ ID NO: 101], 42.2 [SEQ ID NO: 
102], 7.2 [SEQ ID NO: 103], 27.3 [SEQ ID NO: 104], 16.3 [SEQ ID NO: 105], 42.10 [SEQ ID NO: 106], 42-3B [SEQ ID 
NO: 107], 42-11 [SEQ ID NO: 108], F1 [SEQ ID NO: 109], F5 [SEQ ID NO: 110], F3 [SEQ ID NO:111], 42-6B [SEQ ID 
50 NO: 112], and/or 42-12 [SEQ ID NO: 113] by be readily generated using a variety of techniques. 

[0068] Suitable production techniques are well known to those of skill in the art. See, e.g., Sambrooket al. Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press (Cold Spring Harbor, NY). Alternatively, peptides can also 
be synthesized by the well known solid phase peptide synthesis methods (Merrifield, J Am. Chem. Soc, 85:2149 
(1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San Francisco, 1969) pp. 27-62). These and 
55 other suitable production methods are within the knowledge of those of skill in the art and are not a limitation of the 
present invention. 

[0069] Particularly desirable proteins include the AAV capsid proteins, which are encoded by the nucleotide sequenc- 
es identified above. The sequences of many of the capsid proteins of the invention are provided in an alignment in Fig. 
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2 and/or in the Sequence Listing, SEQ ID NO: 2 and 60 to 115, wliicli is incorporated by reference herein. The AAV 
capsid is composed of three proteins, vpl , vp2 and vp3, which are alternative splice variants. The full-length sequence 
provided in these figures is that of vpl . Based on the nunnbering of the AAV7 capsid [SEQ ID NO:2], the sequences 
of vp2 span amino acid 138 - 737 of AAV7 and the sequences of vp3 span amino acids 203 - 737 of AAV7. With this 
5 infomiation, one of skill in the art can readily determine the location of the vp2 and vp3 proteins for the other novel 
serotypes of the invention. 

[0070] Other desirable proteins and fragments of the capsid protein include the constant and variable regions, located 
between hypervariable regions (HPV) and the sequences of the HPV regions themselves. An algorithm developed to 
determine areas of sequence divergence in AAV2 has yielded 12 hypervariable regions (HVR) of which 5 overlap or 

10 are part of the four previously described variable regions. [Chiorini etal, J. Virol, 73:1309-1 9 (1 999); Rutledge etal, J. 
ViroL, 72:309-319] Using this algorithm and/or the alignment techniques described herein, the HVR of the novel AAV 
serotypes are determined. For example, with respect to the number of the AAV2 vpl [SEQ ID NO:70], the HVR are 
located as follows: HVR1 , aa 1 46-1 52; HVR2, aa 1 82-1 86; HVR3, aa 262-264; HVR4, aa 381-383; HVR5, aa 450-474; 
HVR6, aa 490-495; HVR7, aa500-504; HVR8, aa 514-522; HVR9, aa 534-555; HVR10, aa 581-594; HVR11, aa 

15 658-667; and HVR12, aa 705-719. Utilizing an alignment prepared in accordance with conventional methods and the 
novel sequences provided herein [See, e.g.. Figure 2], one can readily determine the location of the HVR in the novel 
AAV serotypes of the invention. For example, utilizing Figure 2, one can readily determine that for AAV7 [SEQ ID NO: 
2]. HVR1 is located at aa 146 - 152; HVR2 is located at 182-187; HVR3 is located at aa 263-266, HVR4 is located at 
aa 383-385, HVR5 is located at aa 451 -475; HVR6 is located at aa 491 -496 of AAV7; HVR7 is located at aa 501 -505; 

20 HVR8 is located at aa 513-521 ; HVR9 is located at 533-554; HVR1 0 is located at aa 583-596; HVR11 is located at aa 
660-669; HVR12 is located at aa 707-721 . Using the information provided herein, the HVRsforthe other novel serotypes 
of the invention can be readily determined. 

[0071] In addition, within the capsid, amino acid cassettes of identity have been identified. These cassettes are of 
particular interest, as they are useful in constructing artificial serotypes, e.g. , by replacing a HVR1 cassette of a selected 

25 serotype with an HVR1 cassette of another serotype. Certain of these cassettes of identity are noted in Fig. 2. See, 
Fig. 2, providing the Clustal X alignment, which has a ruler is displayed below the sequences, starting at 1 for the first 
residue position. The line above the ruler is used to mark strongly conserved positions. Three characters (*, : , .) are 
used. "*" indicates positions which have a single, fully conserved residue. ":" indicates that a "strong" group is fully 
conserved "." Indicates that a "weaker" group is fully conserved. These are all the positively scoring groups that occur 

30 in the Gonnet Pam250 matrix. The strong groups are defined as a strong score >0.5 and the weak groups are defined 
as weak score <0.5. 

[0072] Additionally, examples of other suitable fragments of AAV capsids include, with respect to the numbering of 
AAV2 [SEQ ID NO:70], aa 24 - 42, aa 25 - 28; aa 81 - 85; aa133-165; aa 134 - 165; as 137-143; as 154-156; aa 
194-208; aa 261-274; aa 262-274; aa 171-173; aa 413-417; aa 449-478; aa 494-525; aa 534-571; aa 581-601; aa 

35 660-671 ; aa 709-723. Still other desirable fragments include, for example, in AAV7, amino acids 1 to 1 84 of SEQ ID 
N0:2, amino acids 1 99 to 259; amino acids 274 to 446; amino acids 603 to 659; amino acids 670 to 706; amino acids 
724 to 736; aa 185 to 198; aa 260 to 273; aa447 to 477; aa495 to 602; aa660 to 669; and aa707 to 723. Still other 
desirable regions, based on the numbering of AAV7 [SEQ ID NO:2], are selected from among the group consisting of 
aa 185 to 198; aa 260 to 273; aa447 to 477;aa495 to 602; aa660 to 669; and aa707 to 723. Using the alignment 

40 provided herein performed using the Clustal X program at default settings, or using other commercially or publicly 
available alignment programs at default settings, one of skill in the art can readily determine corresponding fragments 
of the novel AAV capsids of the invention. 

[0073] Other desirable proteins are the AAV rep proteins [aa 1 to 623 of SEQ ID NO:3 for AAV7] and functional 
fragments thereof, including, e.g., aa 1 to 171, aa 172 to 372, aa 373 to 444, aa 445 to 623 of SEQ ID NO:3, among 
45 others. Suitably, such fragments are at least 8 amino acids in length. See, Fig. 3. Comparable regions can be identified 
in the proteins of the other novel AAV of the invention, using the techniques described herein and those which are 
known in the art. In addition, fragments of other desired lengths may be readily utilized. Such fragments may be pro- 
duced recombinantly or by other suitable means, e.g., chemical synthesis. 

[0074] The sequences, proteins, and fragments of the invention may be produced by any suitable means, including 
50 recombinant production, chemical synthesis, or other synthetic means. Such production methods are with in the knowl- 
edge of those of skill in the art and are not a limitation of the present invention. 

IV. Production of rAAV with novel AAV capsids 

55 [0075] The invention encompasses novel, wild-type AAV serotypes identified by the invention, the sequences of 
which wild-type AAV serotypes are free of DNA and/or cellular material with these viruses are associated in nature. In 

another aspect, the present invention provides molecules which utilize the novel AAV sequences of the invention, 
including fragments thereof, for production of molecules useful in delivery of a heterologous gene or other nucleic acid 
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sequences to a target cell. 

[0076] The molecules of the invention which contain sequences of a novel AAV serotype of the invention include 
any genetic element (vector) which may be delivered to a host cell, e.g., naked DNA, a plasmid, phage, transposon, 
cosmid, episome, a protein in a non-viral delivery vehicle (e.g., a lipid-based carrier), virus, etc. which transfer the 

5 sequences carried thereon. The selected vector may be delivered by any suitable method, including transfection, elec- 
troporation, liposome delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral infection and pro- 
toplast fusion. The methods used to construct any embodiment of this invention are known to those with skill in nucleic 
acid manipulation and include genetic engineering, recombinant engineering, and synthetic techniques. See, e.g., 
Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor NY. 

10 [0077] In one embodiment, the vectors of the invention contain sequences encoding a novel AAV capsid of the 
invention (e.g., AAV7 capsid, AAV 44-2 (rh.10), an AAV10 capsid, an AAV11 capsid, an AAV1 2 capsid), or a fragment 
of one or more of these AAV capsids. Alternatively, the vectors may contain the capsid protein, or a fragment thereof, 
itself. 

[0078] Optionally, vectors of the invention may contain sequences encoding AAV rep proteins. Such rep sequences 
15 may be from the same AAV serotype which is providing the cap sequences. Alternatively, the present invention provides 
vectors in which the rep sequences are from an AAV serotype which differs from that which is providing the cap se- 
quences. In one embodiment, the repand capsequences are expressed from separate sources (e.g., separate vectors, 
or a host cell and a vector). In another embodiment, these rep sequences are expressed from the same source as the 
cap sequences. In this embodiment, the rep sequences may be fused in frame to cap sequences of a different AAV 
20 serotype to form a chimeric AAV vector. Optionally, the vectors of the invention further contain a minigene comprising 
a selected transgene which is flanked by AAV 5' ITR and AAV 3' ITR. 

[0079] Thus, in one embodiment, the vectors described herein contain nucleic acid sequences encoding an intact 
AAV capsid which may be from a single AAV serotype (e.g., AAV7 or another novel AAV). Alternatively, these vectors 
contain sequences encoding artificial capsids which contain one or more fragments of the AAV7 (or another novel AAV) 

25 capsid fused to heterologous AAV or non-AAV capsid proteins (or fragments thereof). These artificial capsid proteins 
are selected from non-contiguous portions of the AAV7 (or another novel AAV) capsid or from capsids of other AAV 
serotypes. For example, it may be desirable to modify the coding regions of one or more of the AAV vp1 , e.g., in one 
or more of the hypervariable regions (i.e., HPV1-12), or vp2, and/or vp3, In another example, it may be desirable to 
alter the start codon of the vp3 protein to GTG. These modifications may be to increase expression, yield, and/or to 

30 improve purification in the selected expression systems, or for another desired purpose (e.g., to change tropism or 
alter neutralizing antibody epitopes). 

[0080] The vectors described herein, e.g., a plasmid, are useful for a variety of purposes, but are particularly well 
suited for use in production of a rAAV containing a capsid comprising AAV sequences or a fragment thereof. These 
vectors, including rAAV, their elements, construction, and uses are described in detail herein. 

35 [0081] In one aspect, the invention provides a method of generating a recombinant adeno-associated virus (AAV) 
having an AAV serotype 7 (or another novel AAV) capsid, or a portion thereof. Such a method involves culturing a host 
cell which contains a nucleic acid sequence encoding an adeno-associated virus (AAV) serotype 7 (or another novel 
AAV) capsid protein, or fragment thereof, as defined herein; a functional rep gene; a minigene composed of, at a 
minimum, AAV inverted terminal repeats (ITRs) and a transgene; and sufficient helper functions to permit packaging 

40 of the minigene into the AAV7 (or another novel AAV) capsid protein. 

[0082] The components required to be cultured in the host cell to package an AAV minigene in an AAV capsid may 
be provided to the host cell in trans. Alternatively, any one or more of the required components (e.g., minigene, rep 
sequences, cap sequences, and/or helper functions) may be provided by a stable host cell which has been engineered 
to contain one or more of the required components using methods known to those of skill in the art. Most suitably, such 

45 a stable host cell will contain the required component(s) under the control of an inducible promoter. However, the 
required component(s) may be under the control of a constitutive promoter. Examples of suitable inducible and con- 
stitutive promoters are provided herein, in the discussion of regulatory elements suitable for use with the transgene. 
In still another alternative, a selected stable host cell may contain selected component(s) under the control of a con- 
stitutive promoter and other selected component(s) under the control of one or more inducible promoters. For example, 

50 a stable host cell may be generated which is derived from 293 cells (which contain E1 helperfunctions underthe control 
of a constitutive promoter), but which contains the rep and/or cap proteins underthe control of inducible promoters. 
Still other stable host cells may be generated by one of skill in the art. 

[0083] The minigene, rep sequences, cap sequences, and helper functions required for producing the rAAV of the 
invention may be delivered to the packaging host cell in the form of any genetic element which transfer the sequences 
55 carried thereon. The selected genetic element may be delivered by any suitable method, including those described 
herein. The methods used to construct any embodiment of this Invention are known to those with skill in nucleic acid 
manipulation and include genetic engineering, recombinant engineering, and synthetic techniques. See, e.g., Sam- 
brook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY. Similarly, 
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methods of generating rAAV virions are well known and the selection of a suitable method is not a limitation on the 
present invention. See, e.g., K. Fisher et al, J. Virol. , 70:520-532 (1 993) and US Patent 5,478,745. 

A. The Minigene 

5 

[0084] The minigene is composed of, at a minimum, a transgene and its regulatory sequences, and 5' and 3' AAV 
inverted terminal repeats (ITRs). It is this minigene which is packaged into a capsid protein and delivered to a selected 
host cell. 

10 1 . The transgene 

[0085] The transgene is a nucleic acid sequence, heterologous to the vector sequences flanking the transgene, 

which encodes a polypeptide, protein, or other product, of interest. The nucleic acid coding sequence is operatively 
linked to regulatory components in a manner which permits transgene transcription, translation, and/or expression in 
15 a host cell. 

[0086] The composition ofthe transgene sequence will depend upon the use to which the resulting vector will be put. 
For example, one type of transgene sequence includes a reporter sequence, which upon expression produces a de- 
tectable signal. Such reporter sequences include, without limitation, DNA sequences encoding p-lactamase, p-galac- 
tosidase (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), chloramphenicol acetyl- 
20 transferase (CAT), luciferase, membrane bound proteins including, for example, CD2, CD4, CDS, the influenza he- 
magglutinin protein, and others well known in the art, to which high affinity antibodies directed thereto exist or can be 
produced by conventional means, and fusion proteins comprising a membrane bound protein appropriately fused to 
an antigen tag domain from, among others, hemagglutinin or Myc. 

[0087] These coding sequences, when associated with regulatory elements which drive their expression, provide 

25 signals detectable by conventional means, including enzymatic, radiographic, colorimetric, fluorescence or other spec- 
trographic assays, fluorescent activating cell sorting assays and immunological assays, including enzyme linked im- 
munosorbent assay (ELISA), radioimmunoassay (RIA) and immunohistochemistry. For example, where the marker 
sequence is the LacZ gene, the presence ofthe vector carrying the signal is detected by assays for beta-galactosidase 
activity. Where the transgene is green fluorescent protein or luciferase, the vector carrying the signal may be measured 

30 visually by color or light production in a luminometer. 

[0088] However, desirably, the transgene is a non-marker sequence encoding a product which is useful in biology 
and medicine, such as proteins, peptides, RNA, enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, 
dsRNA, ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful RNA sequence is a sequence 
which extinguishes expression of a targeted nucleic acid sequence in the treated animal. 

35 [0089] The transgene may be used to correct or ameliorate gene deficiencies, which may include deficiencies in 
which normal genes are expressed at less than normal levels or deficiencies in which the functional gene product is 
not expressed. A preferred type of transgene sequence encodes atherapeutic protein or polypeptide which is expressed 
in a host cell. The invention further includes using multiple transgenes, e.g., to correct or ameliorate a gene defect 
caused by a multi-subunit protein. In certain situations, a different transgene may be used to encode each subunit of 

40 a protein, or to encode different peptides or proteins. This is desirable when the size of the DNA encoding the protein 
subunit is large, e.g., for an immunoglobulin, the platelet-derived growth factor, or a dystrophin protein. In order forthe 
cell to produce the multi-subunit protein, a cell is infected with the recombinant virus containing each of the different 
subunits. Alternatively, different subunits of a protein may be encoded by the same transgene. In this case, a single 
transgene includes the DNA encoding each of the subunits, with the DNA for each subunit separated by an internal 

45 ribozyme entry site (IRES). This is desirable when the size of the DNA encoding each of the subunits is small, e.g., 
the total size ofthe DNA encoding the subunits and the IRES is less than five kilobases. As an alternative to an IRES, 
the DNA may be separated by sequences encoding a 2A peptide, which self-cleaves in a post-translational event. See, 
e.g., M.L. Donnelly et al, J. Gen. Virol., 78(Pt 1):13-21 (Jan 1997); Furler, S., et al, Gene Then, 8(11):864-873 (June 
2001); Klump H., etaL, Gene Then, 8(10):811-817 (May 2001). This 2A peptide is significantly smaller than an IRES, 

50 making it well suited for use when space is a limiting factor. However, the selected transgene may encode any biolog- 
ically active product or other product, e.g., a product desirable for study. 

[0090] Suitable transgenes may be readily selected by one of skill in the art. The selection of the transgene is not 
considered to be a limitation of this invention. 

55 2. Regulatory Elements 

[0091] In addition to the major elements identified above for the minigene, the vector also includes conventional 
control elements necessary which are operably linked to the transgene in a manner which permits its transcription. 
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translation and/or expression in a cell transfected with the plasmid vector or infected with the virus produced by the 
invention. As used herein, "operably linked" sequences include both expression control sequences that are contiguous 
with the gene of interest and expression control sequences that act in trans or at a distance to control the gene of interest. 
[0092] Expression control sequences include appropriate transcription initiation, termination, promoter and enhancer 

5 sequences; efficient RNA processing signals such as splicing and polyadenylation (polyA) signals; sequences that 
stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence); se- 
quences that enhance protein stability; and when desired, sequences that enhance secretion of the encoded product. 
A great number of expression control sequences, including promoters which are native, constitutive, inducible and/or 
tissue-specific, are known in the art and may be utilized. 

10 [0093] Examples of constitutive promoters include, without limitation, the retroviral Rous sarcoma virus (RSV) LTR 
promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) 
[see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the p- 
actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EFIa promoter [Invitrogen]. 
[0094] Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied com- 

15 pounds, environmental factors such as temperature, orthe presence of a specific physiological state, e.g., acute phase, 
a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are 
available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other 
systems have been described and can be readily selected by one of skill in the art. Examples of inducible promoters 
regulated by exogenously supplied promoters include the zinc-inducible sheep metallothionine (IVIT) promoter, the 

20 dexamethasone (Dex)-inducible mouse mammary tumor virus (l\/ll\/ITV) promoter, theTZ polymerase promoter system 
[WO 98/1 0088]; the ecdysone insect promoter [No etal, Proc. Natl. Acad. Sci. USA, 93:3346-3351 (1996)], the tetra- 
cycline-repressible system [Gossen etal, Proc. Natl. Acad. Scl. USA, 89:5547-5551 (1992)], the tetracycline-inducible 
system [Gossen et al, Science, 268:1766-1769 (1995), see also Harvey et al, Cum Opin. Chem. Biol., 2:512-518 
(1998)], the RU486-inducible system [Wang etal, Nat. Biotech., 15:239-243 (1997) and Wang etal, Gene Then, 4: 

25 432-441 (1 997)] and the rapamycin-inducible system [Magari et al, J. Clin. Invest, 100:2865-2872 (1 997)]. Still other 
types of inducible promoters which maybe useful in this context are those which are regulated by a specific physiological 
state, e.g., temperature, acute phase, a particular differentiation state of the cell, or in replicating cells only. 
[0095] In another embodiment, the native promoter for the transgene will be used. The native promoter may be 
preferred when it is desired that expression of the transgene should mimic the native expression. The native promoter 

30 may be used when expression of the transgene must be regulated temporally or developmentally, or in a tissue-specific 
manner, or in response to specific transcriptional stimuli. In a further embodiment, other native expression control 
elements, such as enhancer elements, polyadenylation sites or Kozak consensus sequences may also be used to 
mimic the native expression. 

[0096] Another embodiment of the transgene includes a transgene operably linked to a tissue-specific promoter. For 

35 instance, if expression in skeletal muscle is desired, a promoter active in muscle should be used. These include the 
promoters from genes encoding skeletal (3-actin, myosin light chain 2A, dystrophin, muscle creatine kinase, as well as 
synthetic muscle promoters with activities higher than naturally-occurring promoters (see Li et al., Nat. Biotech., 17: 
241 -245 (1 999)). Examples of promoters that are tissue-specific are known for liver (albumin, Miyatake etal., J. Virol., 
71 :51 24-32 (1 997); hepatitis B virus core promoter, Sandig etal., Gene Then, 3:1 002-9 (1 996); alpha-fetoprotein (AFP), 

40 Arbuthnot etal.. Hum. Gene T/ier., 7:1503-14 (1 996)), bone osteocalcin (Stein etal., Mol. Biol. Rep., 24:185-96 (1997)); 
bone sialoprotein (Chen etal., J. Bone Minen Res., 11:654-64 (1996)), lymphocytes (CD2, Hansal etal., J Immunol., 
161:1063-8 (1998); immunoglobulin heavy chain; Tcell receptor a chain), neuronal such as neuron-specific enolase 
(NSE) promoter (Andersen etal.. Cell. Mol. Neurobiol., 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et 
al., Proc. Natl. Acad. Sci. USA, 88:5611-5 (1991)), and the neuron-specific vgf gene (Piccioli etal., Neuron, 15:373-84 

45 (1995)), among others. 

[0097] Optionally, plasmids carrying therapeutically useful transgenes may also include selectable markers or re- 
porter genes may include sequences encoding geneticin, hygromicin or purimycin resistance, among others. Such 
selectable reporters or marker genes (preferably located outside the viral genome to be rescued by the method of the 
invention) can be used to signal the presence of the plasmids in bacterial cells, such as ampicillin resistance. Other 

50 components of the plasmid may include an origin of replication. Selection of these and other promoters and vector 
elements are conventional and many such sequences are available [see, e.g., Sambrook et al, and references cited 
therein]. 

[0098] The combination of the transgene, promoter/enhancer, and 5' and 3' ITRs is referred to as a "minigene" for 
ease of reference herein. Provided with the teachings of this invention, the design of such a minigene can be made 
55 by resort to conventional techniques. 
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3. Delivery of the Minigene to a Packaging Host Cell 

[0099] The minigene can be carried on any suitable vector, e.g., a plasmid, which is delivered to a host cell. The 
plasnnids useful in this invention may be engineered such thatthey are suitable for replication and, optionally, integration 

5 in prokaryotic cells, mammalian cells, or both. These plasmids (or other vectors carrying the 5' AAV ITR-heterologous 
molecule-3'ITR) contain sequences permitting replication of the minigene in eukaryotes and/or prokaryotes and selec- 
tion markers for these systems. Selectable markers or reporter genes may include sequences encoding geneticin, 
hygromicin or purimycin resistance, among others. The plasmids may also contain certain selectable reporters or 
marker genes that can be used to signal the presence of the vector in bacterial cells, such as ampicillin resistance. 

10 Other components of the plasmid may include an origin of replication and an amplicon, such as the amplicon system 
employing the Epstein Barr virus nuclear antigen. This amplicon system, or other similar amplicon components permit 
high copy episomal replication in the cells. Preferably, the molecule carrying the minigene is transfected into the cell, 
where it may exist transiently. Alternatively, the minigene (carrying the 5' AAV ITR-heterologous molecule-3' ITR) may 
be stably integrated into the genome of the host cell, either chromosomally or as an episome. In certain embodiments, 

15 the minigene may be present in multiple copies, optionally in head-to-head, head-to-tail, or tail-to-tail concatamers. 
Suitable transfection techniques are known and may readily be utilized to deliver the minigene to the host cell. 
[0100] Generally, when delivering the vector comprising the minigene by transfection, the vector is delivered in an 
amount from about 5 ja.g to about 1 00 jag DNA, and preferably about 1 0 to about 50 jig DNA to about 1 x 1 0^ cells to 
about 1 X lO"'^ cells, and preferably about 10^ cells. However, the relative amounts of vector DNA to host cells may 

20 be adjusted, taking into consideration such factors as the selected vector, the delivery method and the host cells se- 
lected. 

B. Rep and Cap Sequences 

25 [0101] In addition to the minigene, the host cell contains the sequences which drive expression of the novel AAV 

capsid protein (e.g., AAV7 or other novel AAV capsid or an artificial capsid protein comprising a fragment of one or 
more of these capsids) in the host cell and rep sequences of the same serotype as the serotype of the AAV ITRs found 
in the minigene. The AAV cap and rep sequences may be independently obtained from an AAV source as described 
above and may be introduced into the host cell in any manner known to one in the art as described above. Additionally, 

30 when pseudotyping a novel AAV capsid of the invention, the sequences encoding each of the essential rep proteins 
may be supplied by the same AAV serotype, or the sequences encoding the rep proteins may be supplied by different 
AAV serotypes (e.g., AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, or one of the novel serotypes identified herein). For 
example, the rep78/68 sequences may be from AAV2, whereas the rep52/40 sequences may from AAV1 . 
[0102] In one embodiment, the host cell stably contains the capsid protein under the control of a suitable promoter, 

35 such as those described above. IVost desirably, in this embodiment, the capsid protein is expressed under the control 
of an inducible promoter. In another embodiment, the capsid protein is supplied to the host cell in trans. When delivered 
to the host cell in trans, the capsid protein may be delivered via a plasmid which contains the sequences necessary 
to direct expression of the selected capsid protein in the host cell. Most desirably, when delivered to the host cell in 
trans, the plasmid carrying the capsid protein also carries other sequences required for packaging the rAAV, e.g., the 

40 rep sequences. 

[0103] In another embodiment, the host cell stably contains the rep sequences under the control of a suitable pro- 
moter, such as those described above. Most desirably, in this embodiment, the essential rep proteins are expressed 
under the control of an inducible promoter. In another embodiment, the rep proteins are supplied to the host cell in 
trans. When delivered to the host cell in trans, the rep proteins may be delivered via a plasmid which contains the 
45 sequences necessary to direct expression of the selected rep proteins in the host cell. Most desirably, when delivered 
to the host cell in trans, the plasmid carrying the capsid protein also carries other sequences required for packaging 
the rAAV, e.g., the rep and cap sequences. 

[0104] Thus, in one embodiment, the rep and cap sequences may be transfected into the host cell on a single nucleic 
acid molecule and exist stably in the cell as an episome. In another embodiment, the rep and cap sequences are stably 
50 integrated into the genome of the cell. Another embodiment has the rep and cap sequences transiently expressed in 
the host cell. For example, a useful nucleic acid molecule for such transfection comprises, from 5' to 3', a promoter, 
an optional spacer interposed between the promoter and the start site of the rep gene sequence, an AAV rep gene 
sequence, and an AAV cap gene sequence. 

[0105] Optionally, the rep and/or cap sequences may be supplied on a vector that contains other DNA sequences 
55 that are to be introduced into the host cells. For instance, the vector may contain the rAAV construct comprising the 
minigene. The vector may comprise one or more of the genes encoding the helper functions, e.g., the adenoviral 

proteins E1 , E2a, and E40RF6, and the gene for VAI RNA. 

[0106] Preferably, the promoter used in this construct may be any of the constitutive, inducible or native promoters 
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known to one of skill in the art or as discussed above. In one embodinnent, an AAV P5 pronnoter sequence is ennployed. 
The selection of the AAV to provide any of these sequences does not limit the invention. 

[0107] In another preferred embodiment, the promoter for rep is an inducible promoter, as are discussed above in 
connection with the transgene regulatory elements. One preferred promoter for rep expression is the T7 promoter. The 

5 vector comprising the rep gene regulated by the T7 promoter and the cap gene, is transfected or transformed into a 
cell which either constitutively or inducibly expresses the T7 polymerase. See WO 98/10088, published March 12, 1998. 
[0108] The spacer is an optional element in the design of the vector. The spacer is a DNA sequence interposed 
between the promoter and the rep gene ATG start site. The spacer may have any desired design; that is, it may be a 
random sequence of nucleotides, or alternatively, it may encode a gene product, such as a marker gene. The spacer 

10 may contain genes which typically incorporate start/stop and polyA sites. The spacer may be a non-coding DNA se- 
quence from a prokaryote or eukaryote, a repetitive non-coding sequence, a coding sequence without transcriptional 
controls or a coding sequence with transcriptional controls. Two exemplary sources of spacer sequences are the X 
phage ladder sequences or yeast ladder sequences, which are available commercially, e.g., from Gibco or Invitrogen, 
among others. The spacer may be of any size sufficient to reduce expression of the replS and rep68 gene products, 

f5 leaving the rep52., rep40 and cap gene products expressed at normal levels. The length of the spacer may therefore 
range from about 10 bp to about 10.0 kbp, preferably in the range of about 100 bp to about 8.0 kbp. To reduce the 
possibility of recombination, the spacer is preferably less than 2 kbp in length; however, the invention is not so limited. 
[0109] Although the molecule(s) providing rep and capmay exist in the hostcelltransiently (i.e., through transfection), 
it is preferred that one or both of the rep and cap proteins and the promoter(s) controlling their expression be stably 

20 expressed in the host cell, e.g., as an episome or by integration into the chromosome of the host cell. The methods 
employed for constructing embodiments of this invention are conventional genetic engineering or recombinant engi- 
neering techniques such as those described in the references above. While this specification provides illustrative ex- 
amples of specific constructs, using the information provided herein, one of skill in the art may select and design other 
suitable constructs, using a choice of spacers, P5 promoters, and other elements, including at least one translational 

25 start and stop signal, and the optional addition of polyadenylation sites. 

[0110] In another embodiment of this invention, the rep or cap protein may be provided stably by a host cell. 

0. The Helper Functions 

30 [0111] The packaging host cell also requires helperf unctions in order to package the rAAV of the invention. Optionally, 
these functions may be supplied by a herpesvirus. Most desirably, the necessary helper functions are each provided 
from a human or non-human primate adenovirus source, such as those described above and/or are available from a 
variety of sources, including the American Type Culture Collection (ATCC), Manassas, VA (US). In one currently pre- 
ferred embodiment, the host cell is provided with and/or contains an El a gene product, an El b gene product, an E2a 

35 gene product, and/or an E4 ORF6 gene product. The host cell may contain other adenoviral genes such as VAI RNA, 
but these genes are not required. In a preferred embodiment, no other adenovirus genes or gene functions are present 
in the host cell. 

[0112] By "adenoviral DNA which expresses the El a gene product", it is meant any adenovirus sequence encoding 
E1a or any functional E1a portion. Adenoviral DNA which expresses the E2a gene product and adenoviral DNA which 

40 expresses the E4 ORF6 gene products are defined similarly. Also included are any alleles or other modifications of 
the adenoviral gene or functional portion thereof. Such modifications may be deliberately introduced by resort to con- 
ventional genetic engineering or mutagenic techniques to enhance the adenoviral function in some manner, as well 
as naturally occurring allelic variants thereof. Such modifications and methods for manipulating DNA to achieve these 
adenovirus gene functions are known to those of skill in the art. 

45 [0113] The adenovirus E1 a, Elb, E2a, and/or E40RF6 gene products, as well as any other desired helper functions, 
can be provided using any means that allows their expression in a cell. Each of the sequences encoding these products 
may be on a separate vector, or one or more genes may be on the same vector. The vector may be any vector known 
in the art or disclosed above, including plasmids, cosmids and viruses. Introduction into the host cell of the vector may 
be achieved by any means known in the art or as disclosed above, including transfection, infection, elect roporation, 

50 liposome delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral infection and protoplast fusion, 
among others. One or more of the adenoviral genes may be stably integrated into the genome of the host cell, stably 
expressed as episomes, or expressed transiently. The gene products may all be expressed transiently, on an episome 
or stably integrated, or some of the gene products may be expressed stably while others are expressed transiently. 
Furthermore, the promoters for each of the adenoviral genes may be selected independently from a constitutive pro- 

55 meter, an inducible promoter or a native adenoviral promoter. The promoters may be regulated by a specific physio- 
logical state of the organism or eel I (i.e., by the differentiation state or in replicating or quiescent cells) or by exogenously- 
added factors, for example. 
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D. Host Cells And Packaging Cell Lines 

[0114] The host cell itself may be selected from any biological organism, including prokaryotic (e.g., bacterial) cells, 
and eukaryotic cells, including, insect cells, yeast cells and mammalian cells. Particularly desirable host cells are se- 

5 lectedfrom among any mammalian species, including, without limitation, cells such as A549, WEHI, 3T3, 1 0T1/2, BHK, 
MDCK, COS 1 , COS 7, BSC 1 , BSC 40, BMT 1 0, VERO, WI38, HeLa, 293 cells (which express functional adenoviral 
E1), Saos, C2C12, L cells, HT1080, HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mam- 
mals including human, monkey, mouse, rat, rabbit, and hamster. The selection of the mammalian species providing 
the cells is not a limitation of this invention; nor is the type of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, 

10 etc. The most desirable cells do not carry any adenovirus gene other than E1 , E2a and/or E4 ORF6; nor do they contain 
any other virus gene which could result in homologous recombination of a contaminating virus during the production 
of rAAV; and it is capable of infection or transfection of DNA and expression of the transfected DNA. In a preferred 
embodiment, the host cell is one that has rep and cap stably transfected in the cell. 

[0115] One host cell useful in the present invention is a host cell stably transformed with the sequences encoding 

f5 rep and cap, and which is transfected with the adenovirus E1, E2a, and E40RF6 DNA and a construct carrying the 
minigene as described above. Stable rep and/or cap expressing cell lines, such as B-50 (PCT/US98/19463), or those 
described in U.S. Patent No. 5,658,785, may also be similarly employed. Another desirable host cell contains the 
minimum adenoviral DNA which is sufficient to express E4 ORF6. Yet other cell lines can be constructed using the 
novel AAV rep and/or novel AAV cap sequences of the invention. 

20 [0116] The preparation of a host cell according to this invention involves techniques such as assembly of selected 
DNA sequences. This assembly may be accomplished utilizing conventional techniques. Such techniques include 
cDNAand genomic cloning, which are well known and are described in Sambrooket al., cited above, use of overlapping 
oligonucleotide sequences of the adenovirus and AAV genomes, combined with polymerase chain reaction, synthetic 
methods, and any other suitable methods which provide the desired nucleotide sequence. 

25 [0117] Introduction of the molecules (as plasmids or viruses) into the host cell may also be accomplished using 
techniques known to the skilled artisan and as discussed throughoutthe specification. In preferred embodiment, stand- 
ard transfection techniques are used, e.g., CaP04 transfection or electroporation, and/or infection by hybrid adenovirus/ 
AAV vectors into cell lines such as the human embryonic kidney cell line H EK 293 (a human kidney cell line containing 
functional adenovirus E1 genes which provides trans-acWng E1 proteins). 

30 [0118] These novel AAV-based vectors which are generated by one of skill in the art are beneficial for gene delivery 
to selected host cells and gene therapy patients since no neutralization antibodies to AAV7 have been found in the 
human population. Further, early studies show no neutralizing antibodies in cyno monkey and chimpanzee populations, 
and less than 15% cross-reactivity of AAV 7 in rhesus monkeys, the species from which the serotype was isolated. 
One of skill in the art may readily prepare other rAAV viral vectors containing the AAV7 capsid proteins provided herein 

35 using a variety of techniques known to those of skill in the art. One may similarly prepare still other rAAV viral vectors 
containing AAV7 sequence and AAV capsids of another serotype. Similar advantages are conferred by the vectors 
based on the other novel AAV of the invention. 

[0119] Thus, one of skill in the art will readily understand that the AAV7 sequences of the invention can be readily 

adapted for use in these and other viral vector systems for in vitro, ex vivo or in vivo gene delivery. Similarly, one of 
40 skill in the art can readily select other fragments of the novel AAV genome of the invention for use in a variety of rAAV 
and non-rAAV vector systems. Such vectors systems may include, e.g., lentiviruses, retroviruses, poxviruses, vaccinia 
viruses, and adenoviral systems, among others. Selection of these vector systems is not a limitation of the present 
invention. 

[0120] Thus, the invention further provides vectors generated using the nucleic acid and amino acid sequences of 
45 the novel AAV of the invention. Such vectors are useful for a variety of purposes, including for delivery of therapeutic 
molecules and for use in vaccine regimens. Particularly desirable for delivery of therapeutic molecules are recombinant 
AAV containing capsids of the novel AAV of the invention. These, or other vector constructs containing novel AAV 
sequences of the invention may be used in vaccine regimens, e.g., for co-delivery of a cytokine, or for delivery of the 
immunogen itself. 

50 

V. Recombinant Viruses And Uses Thereof 

[0121] Using the techniques described herein, one of skill in the art may generate a rAAV having a capsid of a novel 
serotype of the Invention, or a novel capsid containing one or more novel fragments of an AAV serotype identified by 
55 the method of the invention. In one embodiment, a full-length capsid from a single serotype, e.g., AAV7 [SEQ ID NO: 
2] can be utilized. In another embodiment, a full-length capsid may be generated which contains one or more fragments 
of a novel serotype of the invention fused in frame with sequences from another selected AAV serotype. For example, 
a rAAV may contain one or more of the novel hypervariable region sequences of an AAV serotype of the invention. 
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Alternatively, the unique AAV serotypes of the invention may be used in constructs containing other viral or non-viral 
sequences. 

[0122] It will be readily apparent to one of skill in the art one ennbodiment, that certain serotypes of the invention will 
be particularly well suited for certain uses. For example, vectors based on AAVZcapsids of the invention are particularly 
5 well suited for use in muscle; whereas vectors based on rh.10 (44-2) capsids of the invention are particularly well suited 
for use in lung. Uses of such vectors are not so limited and one of skill in the art may utilize these vectors for delivery 
to other cell types, tissues or organs. Further, vectors based upon other capsids of the invention may be used for 
delivery to these or other cells, tissues or organs. 

10 A. Delivery of Transgene 

[0123] In another aspect, the present invention provides a method for delivery of a transgene to a host which involves 

transfecting or infecting a selected host cell with a vector generated with the sequences of the AAV of the invention. 
Methods for delivery are well known to those of skill in the art and are not a limitation of the present invention. 

f5 [0124] In one desirable embodiment, the invention provides a method for AAV-mediated delivery of a transgene to 
a host. This method involves transfecting or infecting a selected host cell with a recombinant viral vector containing a 
selected transgene under the control of sequences which direct expression thereof and AAV capsid proteins. 
[0125] Optionally, a sample from the host may be first assayed for the presence of antibodies to a selected AAV 
serotype. A variety of assay formats for detecting neutralizing antibodies are well known to those of skill in the art. The 

20 selection of such an assay is not a limitation of the present invention. See, e.g.. Fisher et al. Nature Med., 3(3):306-312 
(March 1997) and W. C. Manning et al. Human Gene Therapy, 9:477-485 (March 1, 1998). The results of this assay 
may be used to determine which AAV vector containing capsid proteins of a particular serotype are preferred for delivery, 
e.g., by the absence of neutralizing antibodies specific for that capsid serotype. 

[0126] In one aspect of this method, the delivery of vector with a selected AAV capsid proteins may precede or follow 

25 delivery of a gene via a vector with a different serotype AAV capsid protein. Similarty, the delivery of vector with other 
novel AAV capsid proteins of the invention may precede or follow delivery of a gene via a vector with a different serotype 
AAV capsid protein. Thus, gene delivery via rAAV vectors may be used for repeat gene delivery to a selected host cell. 
Desirably, subsequently administered rAAV vectors carry the same transgene as the first rAAV vector, but the subse- 
quently administered vectors contain capsid proteins of serotypes which differ from the first vector. For example, if a 

30 first vector has AAV7 capsid proteins [SEQ ID NO:2], subsequently administered vectors may have capsid proteins 
selected from among the other serotypes, including AAV1 , AAV2, AAV3A, AAV3B, AAV4, AAV6, AAV10, AAV11 , and 
AAV1 2, or any of the other novel AAV capsids identified herein including, without limitation: A3.1 , H2, H6, CI , C2, C5, 
A3-3, A3-7, A3-4, A3-5, 3.3b, 223.4, 223-5, 223-10, 223-2, 223-7, 223-6, 44-1 , 44-5, 44-2, 42-15, 42-8, 42-13, 42-3A, 
42-4, 42-5A, 42-1B, 42-5B, 43-1, 43-12, 43-5, 43-21, 43-25, 43-20, 24.1, 42.2, 7.2, 27.3, 16.3, 42.10, 42-3B, 42-11, 

35 F1 , F5, F3, 42-6B, and/or 42-1 2. 

[0127] The above-described recombinant vectors may be delivered to host cells according to published methods. 
The rAAV, preferably suspended in a physiologically compatible carrier, may be administered to a human or non-human 
mammalian patient. Suitable carriers may be readily selected by one of skill in the art in view of the indication for which 
the transfer virus is directed. For example, one suitable carrier includes saline, which may be formulated with a variety 

40 of buffering solutions (e.g. , phosphate buffered saline). Other exemplary carriers include sterile saline, lactose, sucrose, 
calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, and water. The selection of the carrier is not 
a limitation of the present invention. 

[0128] Optionally, the compositions of the invention may contain, in addition to the rAAV and carrier(s), other con- 
ventional pharmaceutical ingredients, such as preservatives, or chemical stabilizers. Suitable exemplary preservatives 

45 include chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, ethyl vanillin, glyc- 
erin, phenol, and parachlorophenol. Suitable chemical stabilizers include gelatin and albumin. 
[0129] The viral vectors are administered in sufficient amounts to transfect the cells and to provide sufficient levels 
of gene transfer and expression to provide a therapeutic benefit without undue adverse effects, or with medically ac- 
ceptable physiological effects, which can be determined by those skilled in the medical arts. Conventional and phar- 

50 maceutically acceptable routes of administration include, but are not limited to, direct delivery to the selected organ 
(e.g., intraportal delivery to the liver), oral, inhalation (including intranasal and intratracheal delivery), intraocular, in- 
travenous, intramuscular, subcutaneous, intradermal, and other parental routes of administration. Routes of adminis- 
tration may be combined, if desired. 

[0130] Dosages of the viral vector will depend pnmarily on factors such as the condition being treated, the age, 
55 weight and health of the patient, and may thus vary among patients. For example, a therapeutically effective human 
dosage of the viral vector is generally in the range of from about 1 ml to about 1 00 ml of solution containing concen- 
trations of from about 1 x 1 0^ to 1 x 1 0"'^ genomes virus vector. A preferred human dosage may be about 1 x 1 0"^^ to 
1 X lO"'^ AAV genomes. The dosage will be adjusted to balance the therapeutic benefit against any side effects and 
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such dosages may vary depending upon the therapeutic application for which the recombinant vector is employed. 
The levels of expression of the transgene can be monitored to determine the frequency of dosage resulting in viral 
vectors, preferably AAV vectors containing the minlgene. Optionally, dosage regimens similar to those described for 
therapeutic purposes may be utilized for immunization using the compositions of the invention. 
5 [0131] Examples of therapeutic products and immunogenic products for delivery by the AAV-containing vectors of 
the invention are provided below. These vectors may be used for a variety of therapeutic or vaccinal regimens, as 
described herein. Additionally, these vectors may be delivered In combination with one or more other vectors or active 
Ingredients in a desired therapeutic and/or vaccinal regimen. 

10 B. Therapeutic Transgenes 

[0132] Useful therapeutic products encoded by the transgene Include hormones and growth and differentiation fac- 
tors including, without limitation, insulin, glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone 
releasing factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic gonadotropin 

15 (hCG), vascular endothelial growth factor (VEGF), angiopoietins, angiostatin, granulocyte colony stimulating factor 
(GCSF), erythropoietin (EPO), connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic 
fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth factor a (TGFa), platelet-derived 
growth factor (PDGF), insulin growth factors I and II (IGF-I and IGF-11), any one of the transforming growth factor p 
superfamily, including TGF p, activins, inhibins, or any of the bone morphogenic proteins (BMP) BMPs 1-15, anyone 

20 of the heregluln/neuregulln/ARIA/neu differentiation factor (NDF) family of growth factors, nerve growth factor (NGF), 
braln-derlved neurotrophic factor (BDNF), neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (GNTF), glial cell 
line derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrln-1 and 
netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and tyrosine hydroxylase. 
[0133] Other useful transgene products includeproteinsthat regulate the immune system including, without limitation, 

25 cytokines and lymphoklnes such asthrombopoletin (TPO), Interleukins (IL) IL-1 through IL-25 (Including, IL-2, IL-4, IL- 
12, and IL-18), monocyte chemoattractant protein, leukemia inhibitory factor, granulocyte-macrophage colony stimu- 
lating factor. Fas ligand, tumor necrosis factors a and p, interferons a, p, and y, stem cell factor, flk-2/flt3 ligand. Gene 
products produced by the immune system are also useful in the invention. These include, without limitations, immu- 
noglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized antibodies, single chain antibodies, T 

30 cell receptors, chimeric T cell receptors, single chain T cell receptors, class I and class II IVIHC molecules, as well as 
engineered immunoglobulins and MHO molecules, Useful gene products also include complement regulatory proteins 
such as complement regulatory proteins, membrane cofactor protein (IVICP), decay accelerating factor (DAF), GR1 , 
GF2 and CD59. 

[0134] Still other useful gene products include any one of the receptors for the hormones, growth factors, cytokines, 
35 lymphoklnes, regulatory proteins and immune system proteins. The invention encompasses receptors for cholesterol 
regulation, including the low density lipoprotein (LDL) receptor, high density lipoprotein (HDL) receptor, the very low 
density lipoprotein (VLDL) receptor, and the scavenger receptor. The invention also encompasses gene products such 
as members of the steroid honnone receptor superfamily including glucocorticoid receptors and estrogen receptors. 
Vitamin D receptors and other nuclear receptors. In addition, useful gene products include transcription factors such 
40 as jun, fos, max, mad, serum response factor (SRF), AP-1 , AP2, myb, MyoD and myogenin, ETS-box containing pro- 
teins, TFE3, E2F, ATF1 , ATF2, ATF3, ATF4, ZF5, NFAT CREB, HNF-4, C/EBP, SP1 CCAAT-box binding proteins, 
interferon regulation factor (IRF-1), Wilms tumor protein, ETS-bindIng protein, STAT, GATA-box binding proteins, e.g., 
GATA-3, and the forkhead family of winged helix proteins. 

[0135] Other useful gene products include, carbamoyl synthetase 1, ornithine transcarbamylase, arginosuccinate 
45 synthetase, arginosuccinate lyase, arginase, fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha-1 anti- 
trypsin, glucose-6-phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathlone beta-synthase, 
branched chain ketoacid decarboxylase, albumin, isovaleryl-coA dehydrogenase, propionyl CoA carboxylase, methyl 
malonyl CoA mutase, glutaryl CoA dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic phospho- 
rylase, phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a cystic fibrosis transmembrane regulator 
50 (CFTR) sequence, and a dystrophin cDNA sequence. Still other useful gene products include enzymes such as may 
be useful In enzyme replacement therapy, which is useful In a variety of conditions resulting from deficient activity of 
enzyme. For example, enzymes that contain mannose-6-phosphate may be utilized in therapies for lysosomal storage 
diseases (e.g., a suitable gene includes that encoding p-glucuronidase (GUSB)). 

[0136] Other useful gene products Include non-naturally occun^lng polypeptides, such as chimeric or hybrid polypep- 
55 tides having a non-naturally occurring amino acid sequence containing Insertions, deletions or amino acid substitutions. 
For example, single-chain engineered immunoglobulins could be useful In certain Immunocompromised patients. Other 
types of non-naturally occurring gene sequences include antisense molecules and catalytic nucleic acids, such as 
ribozymes, which could be used to reduce overexpression of a target. 
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[0137] Reduction and/or modulation of expression of agene is particularly desirable for treatment of hyperproliferative 
conditions characterized by hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include those 
polypeptides which are produced exclusively or at higher levels in hyperproliferative cells as compared to normal cells. 
Target antigens include polypeptides encoded by oncogenes such as myb, myc, fyn, and the translocation gene bcr/ 

5 abl, ras, src, P53, neu, trk and EGRF, in addition to oncogene products as target antigens, target polypeptides for anti- 
cancer treatments and protective regimens include variable regions of antibodies made by B cell lymphomas and 
variable regions of T cell receptors of T cell lymphomas which, in some embodiments, are also used as target antigens 
for autoimmune disease. Other tumor-associated polypeptides can be used as target polypeptides such as polypeptides 
which are found at higher levels in tumor cells including the polypeptide recognized by monoclonal antibody 1 7-1 A and 

10 folate binding polypeptides. 

[0138] Other suitable therapeutic polypeptides and proteins include those which may be useful for treating individuals 
suffering from autoimmune diseases and disorders by conferring a broad based protective immune response against 
targets that are associated with autoimmunity including cell receptors and cells which produce "self-directed antibod- 
ies. Tcell mediated autoimmune diseases include Rheumatoid arthritis (RA), multiple sclerosis (MS), Sjogren's syn- 

f5 drome, sarcoidosis, insulin dependent diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis^ ankylosing 
spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's granulomatosis, Crohn's dis- 
ease and ulcerative colitis. Each of these diseases is characterized by T cell receptors (TGRs) that bind to endogenous 
antigens and initiate the inflammatory cascade associated with autoimmune diseases. 

20 0. Immunogenic Transgenes 

[0139] Alternatively, or in addition, the vectors of the invention may contain AAV sequences of the invention and a 

transgene encoding a peptide, polypeptide or protein which induces an immune response to a selected immunogen. 
For example, immunogens may be selected from a variety of viral families. Example of desirable viral families against 

25 which an immune response would bedesirableinclude, the picornavirus family, which includes the genera rhinoviruses, 
which are responsibleforabout50% of cases of thecommon cold; the genera enteroviruses, which include polioviruses, 
coxsackieviruses, echovi ruses, and human enteroviruses such as hepatitis A virus; and the genera apthoviruses, which 
are responsible for foot and mouth diseases, primarily in non-human animals. Within the picornavirus family of viruses, 
target antigens include the VP1 , VP2, VPS, VP4, and VPG. Another viral family includes the calcivirus family, which 

30 encompasses the Norwalk group of viruses, which are an important causative agent of epidemic gastroenteritis. Still 
another viral family desirable for use in targeting antigens for inducing immune responses in humans and non-human 
animals is the togavirus family, which includes the genera alphavirus, which include Sindbis viruses, RossRiver virus, 
and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, including Rubella virus. The flaviviridae family 
includes dengue, yellow fever, Japanese encephalitis, St. Louis encephalitis and tick borne encephalitis viruses. Other 

35 target antigens may be generated from the Hepatitis 0 or the coronavirus family, which includes a number of non- 
human viruses such as infectious bronchitis virus (poultry), porcine transmissible gastroenteric virus (pig), porcine 
hemagglutinating encephalomyelitis virus (pig), feline infectious peritonitis virus (cats), feline enteric coronavirus (cat), 
canine coronavirus (dog), and human respiratory coronavi ruses, which may cause the common cold and/or non-A, B 
or 0 hepatitis. Within the coronavirus family, target antigens include the El (also called M or matrix protein), E2 (also 

40 called S or Spike protein), E3 (also called HE or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), 
or N (nucleocapsid). Still other antigens may be targeted against the rhabdovirus family, which includes the genera 
vesiculovirus (e.g.. Vesicular Stomatitis Virus), and the general lyssavirus (e.g., rabies). Within the rhabdovirus family, 
suitable antigens maybe derived from the G protein orthe N protein. The family filoviridae, which includes hemorrhagic 
fever viruses such as Marburg and Ebola virus may be a suitable source of antigens. The paramyxovirus family includes 

45 parainfluenza Virus Type 1 , parainfluenza Virus Type 3, bovine parainfluenza Virus Type 3, rubulavirus (mumps virus, 
parainfluenza Virus Type 2, parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, morbillivirus, 
which includes measles and canine distemper, and pneumovirus, which includes respiratory syncytial virus. The influ- 
enza virus is classified within the family orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the 
N1 protein). The bunyavirus family includes the genera bunyavirus (California encephalitis. La Crosse), phlebovirus 

50 (Rift Valley Fever), hantavirus (puremala is a hemahagin fever virus), nairovirus (Nairobi sheep disease) and various 
unassigned bungaviruses. The arenavirus family provides a source of antigens against LCM and Lassa fever virus. 
The reovirus family includes the genera reovirus, rotavirus (which causes acute gastroenteritis in children), orbiviruses, 
and cultivirus (Colorado Tick fever, Lebombo (humans), equine encephalosis, blue tongue). 

[0140] The retrovirus family includes the sub-family oncorivirinal which encompasses such human and veterinary 
55 diseases as feline leukemia virus, HTLVI and HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), 
simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine infectious anemia virus, andspuma- 
virinal). Between the H IV and SIV, many suitable antigens have been described and can readily be selected. Examples 
of suitable HIV and SIV antigens include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat and Rev proteins, as 
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well as various fragments thereof. In addition, a variety of modifications to these antigens have been described, Suitable 
antigens for this purpose are known to those of skill in the art. For example, one may select a sequence encoding the 
gag, pol, Vif, and Vpr, Env, Tat and Rev, amongst other proteins. See, e.g., the modified gag protein which is described 
in US Patent 5,972,596. See, also, the HIV and SIV proteins described in D.H. Barouch et al, J. Virol., 75(5):2462-2467 
5 (March 2001), and R.R. Amara, et al, Science, 292:69-74 (6 April 2001). These proteins or subunits thereof may be 
delivered alone, or in combination via separate vectors or from a single vector 

[0141] The papovavirus family includes the sub-family polyomaviruses (BKU and JCU viruses) and the sub-family 
papillomavirus (associated with cancers or malignant progression of papilloma). The adenovirus family includes viruses 
(EX, AD7, ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus family feline parvovirus (feline 

10 enteritis), feline panleucopeniavirus, canine parvovirus, and porcine parvovirus. The herpesvirus family includes the 
sub-family alphaherpesvirinae, which encompasses the genera simplexvirus (HSVI, HSVII), varicellovirus (pseudora- 
bies, varicella zoster) and the sub-family betaherpesvirinae, which includes the genera cytomegalovirus (HCMV, 
muromegalovirus) and the sub-family gammaherpesvirinae, which includes the genera lymphocryptovirus, EBV 
(Burkitts lymphoma), infectious rhinotracheitis, Marek's disease virus, and rhadinovirus. The poxvirus family includes 

15 the sub-family chordopoxvirinae, which encompasses the genera orthopoxvirus (Variola (Smallpox) and Vaccinia (Cow- 
pox)), parapoxvirus, avipoxvirus, capri poxvirus, lepori poxvirus, suipoxvirus, and the sub-family entomopoxvirinae. The 
hepadnavirus family includes the Hepatitis B virus. One unclassified virus which may be suitable source of antigens 
is the Hepatitis delta virus. Still other viral sources may include avian infectious bursal disease virus and porcine res- 
piratory and reproductive syndrome virus. The alphavirus family includes equine arteritis virus and various Encephalitis 

20 viruses. 

[0142] The present invention may also encompass immunogens which are useful to immunize a human or non- 
human animal against other pathogens including bacteria, fungi, parasitic microorganisms or multicellular parasites 
which infect human and non-human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial pathogens 
include pathogenic gram-positive cocci include pneumococci; staphylococci; and streptococci. Pathogenic gram-neg- 

25 ative cocci include meningococcus; gonococcus. Pathogenic enteric gram-negative bacilli include enterobacteriaceae; 
pseudomonas, acinetobacteria and eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; H. ducreyi 
(which causes chancroid); brucella; Franisella tularensis (\Nh'\ch causes tularemia); yersinia (pasteurella); streptoba- 
cillus moniliformis and spirillum; Gram-positive bacilli include listeria monocytogenes; erysipelothrix rhusiopathiae; Co- 
rynebacterium diphtheria (diphtheria); cholera; B. anthracis (anthrax); donovanosis (granuloma inguinale); and bar- 

30 tonellosis. Diseases caused by pathogenic anaerobic bacteria include tetanus; botulism; other Clostridia; tuberculosis; 
leprosy; and other mycobacteria. Pathogenic spirochetal diseases include syphilis; treponematoses: yaws, pinta and 
endemic syphilis; and leptospirosis. Other infections caused by higher pathogen bacteria and pathogenic fungi include 
actinomycosis; nocardiosis; cryptococcosis, blastomycosis, histoplasmosis and coccidioidomycosis; candidiasis, as- 
pergillosis, and mucormycosis; sporotrichosis; paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma and 

35 chromomycosis; and dermatophytosis. Rickettsial infections include Typhus fever, Rocky Mountain spotted fever, Q 
fever, and Rickettsialpox. Examples of mycoplasma and chlamydial infections include: mycoplasma pneumoniae; lym- 
phogranuloma venereum; psittacosis; and perinatal chlamydial infections. Pathogenic eukaryotes encompass patho- 
genic protozoans and helminths and infections produced thereby include: amebiasis; malaria; leishmaniasis; trypano- 
somiasis; toxoplasmosis; Pneumocystis carinii; Trichans; Toxoplasma gondii; babesiosis; giardiasis; trichinosis; filaria- 

40 sis; schistosomiasis; nematodes; trematodes or flukes; and cestode (tapeworm) infections. 

[0143] Many of these organisms and/or toxins produced thereby have been identified by the Centers for Disease 
Control [(CDC), Department of Heath and Human Services, USA], as agents which have potential for use in biological 
attacks. For example, some of these biological agents, include, Baciilus anthracis (anthrax), Clostridium botulinum and 
its toxin (botulism). Yersinia pestis (plague), variola major (smallpox), Franciselia tularensis (tularemia), and viral he- 

45 morrhagic fever, all of which are currently classified as Category A agents; Coxiella burnetii (Q fever); Brucella species 
(brucellosis), Burlcholderia ma//e/ (glanders), Ricinus communis and its toxin (ricin toxin), Clostridium perfringens and 
its toxin (epsilon toxin). Staphylococcus species and their toxins (enterotoxin B), all of which are currently classified 
as Category B agents; and Nipan virus and hantaviruses, which are currently classified as Category C agents. In 
addition, other organisms, which are so classified or differently classified, may be identified and/or used for such a 

50 purpose in the future. It will be readily understood that the viral vectors and other constructs described herein are useful 
to deliver antigens from these organisms, viruses, their toxins or other byproducts, which will prevent and/or treat 
infection or other adverse reactions with these biological agents. 

[0144] Administration of the vectors of the invention to deliver immunogens against the variable region of the T cells 
elicit an immune response including CTLs to eliminate those T cells. In rheumatoid arthritis (RA), several specific 
55 variable regions of T cell receptors (TCRs) which are involved in the disease have been characterized. These TCRs 
include V-3, V-14, V-17 and Va-17. Thus, delivery of a nucleic acid sequence that encodes at least one of these 
polypeptides will elicit an immune response that will target T cells involved in RA. In multiple sclerosis (MS), several 
specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V- 
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7 and Va-10. Thus, delivery of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an 
immune response that will target T cells involved in MS. In scleroderma, several specific variable regions of TCRs 
which are involved in the disease have been characterized. These TCRs include V-6, V-8, V-14 and Va-16, Va-3C, 
Va-7, Va-14, Va-15, Va-16, Va-28 and Va-12. Thus, delivery of a nucleic acid molecule that encodes at least one of 
5 these polypeptides will elicit an immune response that will target T cells involved in scleroderma. 

[0145] Optionally, vectors containing AAV sequences of the invention may be delivered using a prime-boost regimen. 
A variety of such regimens have been described in the art and may be readily selected. See, e.g., WO 00/11140, 
published March 2, 2000, incorporated by reference. 

[0146] Such prime-boost regimens typically involve the administration of a DNA (e.g., plasmid) based vector to prime 
10 the immune system to second, booster, administration with a traditional antigen, such as a protein or a recombinant 
virus carrying the sequences encoding such an antigen. In one embodiment, the invention provides a method of priming 
and boosting an immune response to a selected antigen by delivering a plasmid DNA vector carrying said antigen, 
followed by boosting, e.g., with a vector containing AAV sequences of the invention, 

[0147] In one embodiment, the prime-boost regimen involves the expression of multiproteins from the prime and/or 
f5 the boost vehicle. See, e.g., R.R. Amara, Science, 292:69-74 (6 April 2001) which describes a multiprotein regimen 
for expression of protein subunits useful for generating an immune response against HIV and SIV. For example, a DNA 
prime may deliver the Gag, Pol, Vif, VPX and Vpr and Env, Tat, and Rev from a single transcript. Alternatively, the SIV 
Gag, Pol and HIV-1 Env is delivered. 

[0148] However, the prime-boost regimens are not limited to immunization for HIV or to delivery of these antigens. 

20 For example, priming may involve delivering with a first chimp vector of the invention followed by boosting with a second 
chimp vector, or with a composition containing the antigen itself in protein form. In one or example, the prime-boost 
regimen can provide a protective immune response to the virus, bacteria or other organism from which the antigen is 
derived. In another desired embodiment, the prime-boost regimen provides a therapeutic effect that can be measured 
using convention assays for detection of the presence of the condition for which therapy is being administered. 

25 [0149] The priming vaccine may be administered at various sites in the body in a dose dependent manner, which 
depends on the antigen to which the desired immune response is being targeted. The invention is not limited to the 
amount or situs of injection(s) or to the pharmaceutical carrier. Rather, the priming step encompasses treatment reg- 
imens which include a single dose or dosage which is administered hourly, daily, weekly or monthly, or yearly. As an 
example, the mammals may receive one or two priming injection containing between about 10 ^g to about 50 ^g of 

30 plasmid in carrier. A desirable priming amount or dosage of the priming DNA vaccine composition ranges between 
about 1 jxg to about 10,000 ja,g of the DNA vaccine. Dosages may vary from about 1 jig to 1000 |ag DNA per kg of 
subject body weight. The amount or site of injection is desirably selected based upon the identity and condition of the 
mammal being vaccinated. 

[0150] The dosage unit of the DNA vaccine suitable for delivery of the antigen to the mammal is described herein. 

35 The DNA vaccine is prepared for administration by being suspended or dissolved in a pharmaceutically or physiolog- 
ically acceptable carrier such as isotonic saline, isotonic salts solution or other formulations which will be apparent to 
those skilled in such administration. The appropriate carrier will be evident to those skilled in the art and will depend 
in large part upon the route of administration. The compositions of the invention may be administered to a mammal 
according to the routes described above, in a sustained release formulation using a biodegradable biocompatible 

40 polymer, or by on-site delivery using micelles, gels and liposomes, 

[0151] Optionally, the priming step of this invention also includes administering with the priming DNA vaccine com- 
position, a suitable amount of an adjuvant, such as are defined herein. 

[0152] Preferably, a boosting composition is administered about 2 to about 27 weeks after administering the priming 
DNA vaccine to the mammalian subject. The administration of the boosting composition is accomplished using an 

45 effective amount of a boosting vaccine composition containing or capable of delivering the same antigen as adminis- 
tered by the priming DNA vaccine. The boosting composition may be composed of a recombinant viral vector derived 
from the same viral source or from another source. Alternatively, the "boosting composition" can be a composition 
containing the same antigen as encoded in the priming DNA vaccine, but in the form of a protein or peptide, which 
composition induces an immune response in the host. In another embodiment, the boosting vaccine composition in- 

50 eludes a composition containing a DNA sequence encoding the antigen under the control of a regulatory sequence 
directing its expression in a mammalian cell, e.g., vectors such as well-known bacterial or viral vectors. The primary 
requirements of the boosting vaccine composition are that the antigen of the vaccine composition is the same antigen, 
or a cross-reactive antigen, as that encoded by the DNA vaccine. 

[0153] Suitably, the vectors of the invention are also well suited for use in regimens which use non-AAV vectors as 
55 well as proteins, peptides, and/or other biologically useful therapeutic or immunogenic compounds. These regimens 
are particularly well suited to gene delivery for therapeutic poses and for immunization, including inducing protective 

immunity. Such uses will be readily apparent to one of skill in the art. 

[0154] Further, a vector of the invention provides an efficient gene transfer vehicle which can deliver a selected 
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transgene to a selected host cell In vivo or ex vivo e\/ en where the organism has neutralizing antibodies to one or more 
AAV serotypes. In one embodiment, the vector (e.g., an rAAV) and the cells are mixed ex vivo; the infected cells are 
cultured using conventional methodologies; and the transduced cells are re-infused into the patient. Further, the vectors 
of the invention may also be used for production of a desired gene product in vitro. For in vitro production, a desired 
5 product (e.g., a protein) may be obtained from a desired culture following transfection of host cells with a rAAV containing 
the molecule encoding the desired product and culturing the cell culture under conditions which permit expression. 
The expressed product may then be purified and isolated, as desired. Suitable techniques for transfection, cell culturing, 
purification, and isolation are known to those of sl<ill in the art. 

[0155] The following examples illustrate several aspects and embodiments of the invention. 

10 

EXAMPLES 

Example 1 : PGR amplification, cloning and characterization of novel AAV sequences. 

15 [0156] Tissues from non human primates were screened for AAV sequences using a PGR method based on oligo- 
nucleotides to highly conserved regions of l^nown AAVs. A stretch of AAV sequence spanning 2886 to 3143 bp of AAV1 
[SEQ ID NO:6] was selected as a PGR amplicon in which a hypervariable region of the capsid protein (Gap) that is 
unique to each known AAV serotype, which is termed herein a "signature region," is flanked by conserved sequences. 
In later analysis, this signature region was shown to be located between conserved residues spanning hypervariable 

20 region 3. 

[0157] An initial survey of peripheral blood of a number of nonhuman primate species revealed detectable AAV in a 
subset of animals from species such as rhesus macaques, cynomologous macaques, chimpanzees and baboons. 
However, there were no AAV sequences detected in some other species tested, including Japanese macaques, pig- 
tailed macaques and squirrel monkeys. A more extensive analysis of vector distribution was conducted in tissues of 
25 rhesus monkeys of the University of Pennsylvania and Tulane colonies recovered at necropsy. This revealed AAV 
sequence throughout a wide array of tissues. 

A. Ampiiffcation of an AAV signature region 

30 [0158] DN A sequences of AAV1 -6 and AAVs isolated from Goose and Duck were aligned to each other using "Glustal 
W" at default settings. The alignment for AAV1-6, and including the information for the novel AAV7, is provided in Fig. 
1 . Sequence similarities among AAVs were compared. 

[0159] In the line of study, a 257 bp region spanning 2886 bp to 3143 bp of AAV 1 [SEQ ID NO: 6], and the corre- 
sponding region in the genomes of AAV 2-6 genomes [See, Fig. 1], was identified by the inventors. This region is 

35 located with the AAV capsid gene and has highly conserved sequences among at both 5' and 3' ends and is relatively 
variable sequence in the middle. In addition, this region contains a Dralll restriction enzyme site (GAGCAGGTG, SEQ 
ID NO:15) . The inventors have found that this region serves as specific signature for each known type of AAV DNA. 
In other words, following PGR reactions, digestion with endonucleases that are specific to each known serotypes and 
gel electrophoresis analysis, this regions can be used to definitively identify amplified DNA as being from serotype 1 , 

40 2, 3, 4, 5, 6, or another serotype. 

[0160] The primers were designed, validated and PGR conditions optimized with AAV1 , 2 and 5 DNA controls, The 
primers were based upon the sequences of AAV2: 5' primer, 1 S: bp 2867-2891 of AAV2 (SEQ ID N0:7) and 3' primer, 
18as, bp 3095-3121 of AAV2 (SEQ ID NO:7). 

[0161] Gellular DNAs from different tissues including blood, brain, liver, lung, testis, etc. of different rhesus monkeys 
45 were studied utilizing the strategy described above. The results revealed that DNAs from different tissues of these 
monkeys gave rise to strong PGR amplifications. Further restriction analyses of PGR products indicated that they were 
amplified from AAV sequences different from any published AAV sequences. 

[0162] PGR products (about 255 bp in size) from DNAs of a variety of monkey tissues have been cloned and se- 
quenced. Bioinformatics study of these novel AAV sequences indicated that they are novel AAV sequences of capsid 
50 gene and distinct from each other. Fig. 1 includes in the alignment the novel AAV signature regions for AAV1 0-12 [SEQ 
ID N0:117, 118 and 119, respectively]. IVIultiple sequence alignment analysis was performed using the Glustal W (1 .81) 
program. The percentage of sequence identity between the signature regions of AAV 1-7 and AAV 10-12 genomes is 
provided below. 

55 
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Table 1 . 



Sequences for Analysis 


Sequence 


AAV 


Size 


# 


Serotype 


(bp) 








1 


AAV1 


258 


2 


AAV2 


255 


3 


AAV3 


255 


4 


AAV4 


246 


5 


AAV5 


258 


6 


AAV6 


258 


7 


AAV7 


258 


10 


AAV10 


255 


11 


AAV11 


258 


12 


AAV12 


255 



Table 3. 



30 



35 



40 



Palrwise Alignment (Percentage of Identity) 




AAV2 


AAV3 


AAV4 


AAV5 


AAV6 


AAV7 


AAV10 


AAV11 


AAV12 


AAV1 


90 


90 


81 


76 


97 


91 


93 


94 


93 


AAV2 




93 


79 


78 


90 


90 


93 


93 


92 


AAV3 






80 


76 


90 


92 


92 


92 


92 


AAV4 








76 


81 


84 


82 


81 


79 


AAV5 










75 


78 


79 


79 


76 


AAV6 












91 


92 


94 


94 


AAV7 














94 


92 


92 


AAV10 
















95 


93 


AAV11 


















94 



[0163] Over 300 clones containing novel AAV serotype sequences that span the selected 257 bp region were Isolated 
and sequenced. Bioinformatics analysis of these 300+ clones suggests that this 257 bp region is critical in serving as 
a good land marker or signature sequence for quick isolation and Identification of novel AAV serotype. 



B, Use of the signature region for PCR amplification. 



[0164] The 257 bp signature region was used as a PCR anchor to extend PCR amplifications to 5' of the genome to 
cover the junction region of rep and cap genes (1398 bp - 3143 bp, SEQ ID NO:6) and 3' of the genome to obtain the 
entire cap gene sequence (2866 bp - 4600 bp, SEQ ID NO:6). PCR amplifications were carried out using the standard 
conditions, including denaturing at 95°C for 0.5-1 min, annealing at 60-65°C for 0.5-1 min and extension at 72° C for 
1 min per kb with a total number of amplification cycles ranging from 28 to 42. 

[0165] Using the aligned sequences as described in "A", two other relative conserved regions were identified in the 
sequence located in 3' end of rep genes and 5' to the 257 bp region and in the sequence down stream of the 257 bp 
fragment but before the AAV 3 ITR. Two sets of new primers were designed and PCR conditions optimized for recovery 
of entire capsid and a part of rep sequences of novel AAV serotypes. More specifically, for the 5' amplification, the 5' 
primer, AVI Ns, was GCTGCGTCAACTGGACCAATGAGAAC [nt 1 398-1423 of AAV1 , SEQ ID NO:6] and the 3' primer 
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was 18as, identified above. For the 3' amplification, the 5' prinner was 1s, identified above, and the 3' prinner was 
AV2Las, TCGTTTCAGTTGAACTTTGGTCTCTGCG [nt 4435-4462 of AAV2, SEQ ID NO:7]. 

[0166] In these PGR amplifications, the 257 bp region was used as a PGR anchor and land marker to generate 
overlapping fragments to construct a complete capsid gene by fusion at the Dralll site in the signature region following 
5 amplification of the 5' and 3' extension fragments obtained as described herein. More particularly, to generate the intact 
AAV7 cap gene, the three amplification products (a) the sequences of the signature region; (b) the sequences of the 
5' extension; and (c) the sequences of the 3' extension were cloned into a pGR4-Topo [Invitrogen] plasmid backbone 
according to manufacturer's instructions. Thereafter, the plasmids were digested with Dralll and recombined to form 
an intact cap gene. 

10 [0167] In this line of worl<, about 80 % of capsid sequences of AAV7 and AAV 8 were isolated and analyzed. Another 
novel serotype, AAV9, was also discovered from Monkey #2. 

[0168] Using the PGR conditions described above, the remaining portion of the rep gene sequence for AAV7 is 

isolated and cloned using the primers that amplify 108 bp to 1461 bp of AAV genome (calculated based on the num- 
bering of AAV2, SEQ ID NO:7). This clone is sequenced for construction of a complete AAV7 genome without ITRs. 

15 

G. Direct Amplification of 3.1 kb Gap fragment 

[0169] To directly amplify a 3.1 kb full-length Gap fragment from NHP tissue and blood DNAs, two other highly con- 
served regions were identified in AAV genomes for use in PGR amplification of large fragments. A primer within a 

20 conserved region located in the middle of the rep gene was selected (AVI ns: 5' GGTGGGTGAAGTGGAGGAATGA- 
GAAC 3', nt 1 398-1423 of SEQ ID NO:6) in combination with the 3' primer located in another conserved region down- 
stream of the Gap gene (AV2cas: 5' GGGAGAGAGGAAAGTTGAAGTGAAAGGA 3', SEQ ID NO:7) for amplification of 
full-length cap fragments. The PGR products were Topo-cloned according to manufacturer's directions (Invitrogen) and 
sequence analysis was performed by Qiagengenomics (Qiagengenomics, Seattle, WA) with an accuracy of > 99.9%. 

25 A total of 50 capsid clones were isolated and characterized. Among them, 37 clones were derived from Rhesus 
macaque tissues (rh.1 - rh.37), 6 clones from cynomologous macaques (cy.1 - cy.6), 2 clones from Baboons (bb.1 and 
bb.2) and 5 clones from Chimps (ch.1 - ch.5). 

[0170] To rule out the possibility that sequence diversity within the novel AAV family was not an artifact of the PGR, 
such as PGR-mediated gene splicing by overlap extension between different partial DNA templates with homologous 

30 sequences, or the result of recombination process in bacteria, a series of experiments were perfomned under identical 
conditions for VP1 amplification using total cellular DNAs. First, intact AAV7 and AAV8 plasmids were mixed at an 
equal molar ratio followed by serial dilutions. The serially diluted mixtures were used as templates for PGR amplification 
of 3.1 kb VP1 fragments using universal primers and identical PGR conditions to that were used for DNA amplifications 
to see whether any hybrid PGR products were generated. The mixture was transformed into bacteria and isolated 

35 transformants to look for hybrid clones possibly derived from recombination process in bacterial cells. In a different 
experiment, we restricted AAV7 and AAV8 plasmids with Msp I, Ava I and Hael, all of which cut both genomes multiple 
times at different positions, mixed the digestions in different combinations and used them for PGR amplification of VP1 
fragments under the same conditions to test whether any PGR products could be generated through overlap sequence 
extension of partial AAV sequences. In another experiment, a mixture of gel purified 5' 1 .5 kb AAV7 VP1 fragment and 

40 3' 1 .7 kb AAV8 VP1 fragment with overlap in the signature region was serially diluted and used for PGR amplification 
in the presence and absence of 200 ng cellular DNA extracted from a monkey cell line that was free of AAV sequences 
by TaqMan analysis. None of these experiments demonstrated efficient PGR-mediated overlap sequence production 
under the conditions of the genomic DNA Cap amplification (data not shown). As a further confirmation, 3 pairs of 
primers were designed, which were located at different HVRs, and were sequence specific to the variants of clone 42s 

45 from Rhesus macaque F953, in different combinations to amplify shorter fragments from mesenteric lymph node (MLN) 
DNA from F953 from which clone 42s were isolated. All sequence variations identified in full-length Gap clones were 
found in these short fragments (data not shown). 

Example 2: Aden o- Associated Viruses Undergo Substantial Evolution in Primates During Natural Infections 

50 

[0171] Sequence analysis of selected AAV isolates revealed divergence throughout the genome that is most con- 
centrated in hypervariable regions of the capsid proteins. Epidemiologic data indicate that all known serotypes are 
endemic to primates, although isolation of clinical isolates has been restricted to AAV2 and AAV3 from anal and throat 
swabs of human infants and AAV5 from a human condylomatous wart. No known clinical sequalae have been asso- 
55 ciated with AAV infection. 

[0172] In an attempt to better understand the biology of AAV, nonhuman primates were used as models to charac- 
terize the sequlae of natural infections. Tissues from nonhuman primates were screened for AAV sequences using the 
PGR method of the invention based on oligonucleotides to highly conserved regions of known AAVs (see Example 1). 
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A stretch of AAV sequence spanning 2886 to 3143 bp of AAV1 [SEQ ID N0:6] was selected as a PGR annplicon in 
which conserved sequences are flanked by a hypervariable region that is unique to each known AAV serotype, termed 
herein a "signature region." 

[0173] An initial survey of peripheral blood of a number of nonhuman primate species including rhesus monkeys, 

5 cynomologous monkeys, chimpanzees, and baboons revealed detectable AAV in a subset of animals from all species. 
A more extensive analysis of vector distribution was conducted in tissues of rhesus monkeys of the University of Penn- 
sylvania and Tulane colonies recovered at necropsy. This revealed AAV sequence throughout a wide array of tissues. 
[0174] The amplified signature sequences were subcloned into plasmids and individual transformants were subjected 
to sequence analysis. This revealed substantial variation in nucleotide sequence of clones derived from different ani- 

10 mals. Variation in the signature sequence was also noted in clones obtained within individual animals. Tissues har- 
vested from two animals in which unique signature sequences were identified (i.e., colon from 98E044 and heart from 
98E056) were further characterized by expanding the sequence amplified by PGR using oligonucleotides to highly 
conserved sequences. In this way, complete proviral structures were reconstructed for viral genomes from both tissues 
as described herein. These proviruses differ from the other known AAVs with the greatest sequence divergence noted 

15 in regions of the Gap gene. 

[0175] Additional experiments were performed to confirm that AAV sequences resident to the nonhuman primate 
tissue represented proviral genomes of infectious virus that is capable of being rescued and fomri virions. Genomic 
DNA from liver tissue of animal 98E056, from which AAVS signature sequence was detected, was digested with an 
endonuclease that does not have a site within the AAV sequence and transfected into 293 cells with a plasmid containing 

20 an E1 deleted genome of human adenovirus serotype 5 as a source of helper functions. The resulting lysate was 
passaged on 293 cells once and the lysate was recovered and analyzed for the presence of AAV Cap proteins using 
a broadly reacting polyclonal antibody to Cap proteins and for the presence and abundance of DNA sequences from 
the PGR amplified AAV provirus from which AAVS was derived. Transfection of endonuclease restricted heart DNA 
and the adenovirus helper plasmid yielded high quantities of AAVS virus as demonstrated by the detection of Gap 

25 proteins by Western blot analysis and the presence of 1 0^ AAVS vector genomes per 293 cell. Lysates were generated 
from a large-scale preparation and the AAV was purified by cesium sedimentation. The purified preparation demon- 
strated 26 nm icosohedral structures that look identical to those of AAV serotype 2. Transfection with the adenovirus 
helper alone did not yield AAV proteins or genomes, ruling out contamination as a source of the rescued AAV. 
[0176] To further characterize the inter and intra animal variation of AAV signature sequence, selected tissues were 

30 subjected to extended PGR to amplify entire Gap open reading frames. 

[0177] The resulting fragments were cloned into bacterial plasmids and individual transformants were isolated and 
fully sequenced. This analysis involved mesenteric lymph nodes from three rhesus monkeys (Tulane/V223 - 6 clones; 
Tulane/T612 - 7 clones; Tulane/F953 - 14 clones), liver from two rhesus monkeys (Tulane/V251 - 3 clones; Penn/ 
00E033 - 3 clones), spleen from one rhesus monkey (Penn/97E043 - 3 clones), heart from one rhesus monkey (IHGT/ 

35 9SE046- 1 clone) and peripheral blood from one chimpanzee (New lberia/X133 - 5 clones), six cynomologous 
macaques (Charles River/A1378, A3099, A338S, A3442, A2821 , A3242 - 6 clones total) and one Baboon (SFRB/8644 
- 2 clones). Of the 50 clones that were sequenced from 1 5 different animals, 30 were considered non-redundant based 
on the finding of at least 7 amino acid differences from one another. The non-redundant VP1 clones are numbered 
sequentially as they were isolated, with a prefix indicating the species of non-human primate from which they were 

40 derived. The structural relationships between these 30 non-redundant clones and the previously described 8 AAV 
serotypes were determined using the SplitsTree program [Huson, D. H. SplitsTree: analyzing and visualizing evolu- 
tionary data. BioinformaticsAA, 68-73 (1 998)] with implementation of the method of split decomposition. The analysis 
depicts homoplasy between a set of sequences in a tree-like network rather than a bifurcating tree. The advantage is 
to enable detection of groupings that are the result of convergence and to exhibit phylogenetic relationships even when 

45 they are distorted by parallel events. Extensive phylogenetic research will be required in order to elucidate the AAV 
evolution, whereas the intention here only is to group the different clones as to their sequence similarity. 
[0178] To confirm that the novel VP1 sequences were derived from infectious viral genomes, cellular DNA from 
tissues with high abundance of viral DNA was restricted with an endonuclease that should not cleave within AAV and 
transfected into 293 cells, followed by infection with adenovirus. This resulted in rescue and amplification of AAV ge- 

50 nomes from DNA of tissues from two different animals (data not shown). 

[0179] VP1 sequences of the novel AAVs were further characterized with respect to the nature and location of amino 
acid sequence variation. All 30 VP1 clones that were shown to differ from one another by greater than 1% amino acid 
sequence were aligned and scored for variation at each residue. An algorithm developed to determine areas of se- 
quence divergence yielded 12 hypervariable regions (HVR) of which 5 overlap or are part of the 4 previously described 

55 variable regions [Kotin, cited above; Rutledge, cited above]. The three-fold-proximal peaks contain most of the varia- 
bility (HVR5-10). Interestingly the loops located at the 2 and 5 fold axis show intense variation as well. The HVRs 1 
and 2 occur in the N-terminal portion of the capsid protein that is not resolved in the X-ray structure suggesting that 
the N-terminus of the VP1 protein is exposed on the surface of the virion. 
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[0180] Real-time PGR was used to quantify AAV sequences fronn tissues of 21 rfiesus monkeys using primers and 
probes to highly conserved regions of Rep (one set) and Cap (two sets) of known AAVs. Each data point represents 
analysis from tissue DNA from an individual animal. This confirmed the wide distribution of AAV sequences, although 
the quantitative distribution differed between individual animals. The source of animals and previous history ortreat- 

5 ments did not appear to influence distribution of AAV sequences in rhesus macaques. The three different sets of primers 
and probes used to quantify AAV yielded consistent results. The highest levels of AAV were found consistently in 
mesenteric lymph nodes at an average of 0.01 copies per diploid genome for 13 animals that were positive. Liver and 
spleen also contained high abundance of virus DNA. There were examples of very high AAV, such as in heart of rhesus 
macaque 98E056, spleen of rhesus macaque 97E043 and liver of rhesus macaque RQ4407, which demonstrated 1 .5, 

10 3 and 20 copies of AAV sequence per diploid genome respectively. Relatively low levels of virus DNA were noted in 
peripheral blood mononuclear cells, suggesting the data in tissue are not due to resident blood components (data not 
shown). It should be noted that this method would not necessarily capture all AAVs resident to the non human primates 
since detection requires high homology to both the oligonucleotides and the real time PGR probe. Tissues from animals 
with high abundance AAV DNA was further analyzed for the molecular state of the DNA, by DNA hybridization tech- 

15 niques, and its cellular distribution, by in situ hybridization. 

[0181] The kind of sequence variation revealed in AAV proviral fragments isolated from different animals and within 
tissues of the same animals is reminiscent of the evolution that occurs for many RNA viruses during pandemics or 
even within the infection of an individual, In some situations the notion of a wild-type virus has been replaced by the 
existence of swarms of quasispecies that evolve as a result of rapid replication and mutations in the presence of 

20 selective pressure. One example is infection by HIV, which evolves in response to immunologic and pharmacologic 
pressure. Several mechanisms contribute to the high rate of mutations in RNA viruses, including low fidelity and lack 
of proof reading capacity of reverse transcriptase and non-homologous and homologous recombination. 
[0182] Evidence for the formation of quasispecies of AAV was illustrated in this study by the systematic sequencing 
of multiple cloned proviral fragments. In fact, identical sequences could not be found within any extended clones isolated 

25 between or within animals. An important mechanism for this evolution of sequence appears to be a high rate of ho- 
mologous recombination between a more limited number of parenteral viruses. The net result is extensive swapping 
of hypervariable regions of the Gap protein leading to an array of chimeras that could have different tropisms and 
serologic specificities (i.e., the ability to escape immunologic responses especially as it relates to neutralizing antibod- 
ies). Mechanisms by which homologous recombination could occur are unclear. One possibility is that + and - strands 

30 of different single stranded AAV genomes anneal during replication as has been described during high multiplicity of 
infections with AAV recombinants. It is unclear if other mechanisms contribute to sequence evolution in AAV infections. 
The overall rate of mutation that occurs during AAV replication appears to be relatively low and the data do not suggest 
high frequencies of replication errors. However, substantial rearrangements of the AAV genome have been described 
during lytic infection leading to the formation of defective interfering particles. Irrespective of the mechanisms that lead 

35 to sequence divergence, with few exceptions , vpl structures of the quasispecies remained intact without frameshifts 
or nonsense mutations suggesting that competitive selection of viruses with the most favorable profile of fitness con- 
tribute to the population dynamics. 

[0183] These studies have implications in several areas of biology and medicine. The concept of rapid virus evolution, 

formerly thought to be a property restricted to RNA viruses, should be considered in DNA viruses, which classically 
40 have been characterized by serologic assays. It will be important in terms of parvoviruses to develop a new method 
for describing virus isolates that captures the complexity of its structure and biology, such as with HIV, which are 
categorized as general families of similar structure and function called Glades. An alternative strategy is to continue 
to categorize isolates with respect to serologic specificity and develop criteria for describing variants within serologic 
groups. 

45 

Example 3: Vectorology of recombinant AAV genomes equipped with AAV2 ITRs using chimeric plasmids containing 
AAV2 rep and novel AAV cap genes for serological and gene transfer studies in different animal models. 

[0184] Chimeric packaging constructs are generated by fusing AAV2 rep with cap sequences of novel AAV serotypes. 
50 These chimeric packaging constructs are used, initially, for pseudotyping recombinant AAV genomes carrying AAV2 
ITRs by triple transfection in 293 cell using Ad5 helper plasmid. These pseudotyped vectors are used to evaluate 
performance in transduction-based serological studies and evaluate gene transfer efficiency of novel AAV serotypes 
in different animal models including NHP and rodents, before intact and infectious viruses of these novel serotypes 
are isolated. 

55 

A. pAAV2GFP 

[0185] The AAV2 plasmid which contains the AAV2 ITRs and green fluorescent protein expressed under the control 
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of a constitutitive promoter. This plasmid contains the following elennents: the AAV2 ITRs, a CIVIV promoter, and the 
GFP coding sequences. 

B. Cloning of trans p\asm\d 

5 

[0186] To construct the chimeric trans-plasmid for production of recombinant pseudotyped AAV7 vectors, p5E18 
plasmid (Xiao etaL, 1999, J. V/ro/ 73:3994-4003) was partially digested with Xho I to linearize the plasmid at the Xho 
I site at the position of 3169 bp only. The Xho I cut ends were then filled in and ligated back. This modified p5E18 
plasmid was restricted with Xba I and Xho I in a complete digestion to remove the AAV2 cap gene sequence and 
10 replaced with a 2267 bp Spe l/Xho I fragment containing the AAV7 cap gene which was isolated from pCRAAV7 
6-5+15-4 plasmid. 

[0187] The resulting plasmid contains the AAV2 rep sequences for Rep78/68 under the control of the AAV2 P5 

promoter, and the AAV2 rep sequences for Rep52/40 under the control of the AAV2 P19 promoter. The AAV7 capsid 
sequences are under the control of the AAV2 P40 promoter, which is located within the Rep sequences. This plasmid 
15 further contains a spacer 5' of the rep ORF. 

C. Production of Pseudotyped rAAV 

[0188] The rAAV particles (AAV2 vector in AAV7 capsid) are generated using an adenovirus-free method. Briefly, 
20 the cis plasmid (pAAV2.1 lacZ plasmid containing AAV2 ITRs), and the trans plasmid pCRAAV7 6-5+15-4 (containing 
the AAV2 rep and AAV7 cap) and a helper plasmid, respectively, were simultaneously co-transfected into 293 cells in 
a ratio of 1 ;1 :2 by calcium phosphate precipitation. 

[0189] For the construction of the pAd helper plasmids, pBG1 0 plasmid was purchased from Microbix (Canada). A 
RsrI I fragment containing L2 and L3was deleted frompBHGIO, resulting in the first helper plasmid, pAdAF13. Plasmid 

25 AdA F1 was constructed by cloning Asp700/Sall fragment with a Pmel/Sgfl deletion, isolating from pBHGIO, into 
Bluescript. MLP, L2, L2 and L3 were deleted in the pAdAFI . Further deletions of a 2.3 kb Nrul fragment and, subse- 
quently, a 0.5 kb RsrI l/Nru I fragment generated helper plasmids pAdAF5 and pAdAF6, respectively. The helper plasmid, 
termed pAF6, provides the essential helper functions of E2a and E4 0RF6 not provided by the El -expressing helper 
cell, but is deleted of adenoviral capsid proteins and functional El regions). 

30 [0190] Typically, 50 ^g of DNA (cis:trans:helper) was transfected onto a 150 mm tissue culture dish. The 293 cells 
were harvested 72 hours post-transfection, sonicated and treated with 0.5% sodium deoxycholate (37°C for 10 min.) 
Cell lysates were then subjected to two rounds of a CsCI gradient. Peak fractions containing rAAV vector are collected, 
pooled and dialyzed against PBS. 

35 Example 4: Creation of infectious clones carrying intact novel AAV serotypes for study of basic virology in human and 
NHP derived cell lines and evaluation of pathogenesis of novel AAV serotypes in NHP and other animal models. 

[0191] To achieve this goal, the genome walker system is employed to obtain 5' and 3' terminal sequences (ITRs) 
and complete construction of clones containing intact novel AAV serotype genomes. 

40 [0192] Briefly, utilizing a commercially available Universal Genome Walker Kit [Clontech], genomic DNAs from mon- 
key tissues or cell lines that are identified as positive for the presence of AAV7 sequence are digested with Dra I, EcoR 
V, Pvu II and Stu I endonucleases and ligated to Genome Walker Adaptor to generate 4 individual Genome Walker 
Libraries (GWLs). Using DNAs from GWLs as templates, AAV7 and adjacent genomic sequences will be PCR-amplified 
by the adaptor primer 1 (API, provided in the kit) and an AAV7 specific primer 1 , followed by a nested PGR using the 

45 adaptor primer 2 (AP2) and another AAV7 specific primer 2, both of which are internal to the first set of primers. The 
major PCR products from the nested PGR are cloned and characterized by sequencing analysis. 
[0193] In this experiment, the primers covering the 257 bp or other signature fragment of a generic AAV genome are 
used for PCR amplification of cellular DNAs extracted from Human and NHP derived cell lines to identify and charac- 
terize latent AAV sequences. The identified latent AAV genomes are rescued from the positive cell lines using aden- 

50 ovirus helpers of different species and strains. 

[0194] To isolate infectious AAV clones from NHP derived cell lines, a desired cell line is obtained from ATCC and 
screened by PCR to identify the 257 bp amplicon, i.e., signature region of the invention. The 257 bp PCR product is 
cloned and serotyped by sequencing analysis. For these cell lines containing the AAV7 sequence, the cells are infected 
with SV-15, a simian adenovirus purchased from ATCC, human Ad5 or transfected with plasmid construct housing the 

55 human Ad genes that are responsible for AAV helper functions. At 48 hour post infection or transfection, the cells are 
harvested and Hirt DNA is prepared for cloning of AAV7 genome following Xiao et al., 1 999, J. Virol, 73:3994-4003. 
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Example 5 - Production of AAV Vectors 

[0195] A pseudotyping strategy similar to that of Example 3 for AAV 1/7 was employed to produce AAV2 vectors 
packaged with AAV1 , AAV5 and AAV8 capsid proteins. Briefly, recombinant AAV genomes equipped with AAV2 ITRs 
were packaged by triple transfection of 293 cells with cis-plasmid, adenovirus helper plasmid and a chimeric packaging 
construct where the AAV2 rep gene is fused with cap genes of novel AAV serotypes. To create the chimeric packaging 
constructs, the Xho I site of p5E1 8 plasmid at 31 69 bp was ablated and the modified plasmid was restricted with Xba 
I and Xho I in a complete digestion to remove the AAV2 cap gene and replace it with a 2267 bp Spe l/Xho I fragment 
containing the AAV8 cap gene [Xiao, W., et al., (1 999) J Virol 73, 3994-4003]. A similar cloning strategy was used for 
creation of chimeric packaging plasmids of AAV2/1 and AAV2/5. All recombinant vectors were purified by the standard 
CsClg sedimentation method except for AAV2/2, which was purified by single step heparin chromatography. 
[0196] Genome copy (GC) titers of AAV vectors were determined by TaqlVlan analysis using probes and primers 
targeting SV40 poly A region as described previously [Gao, G,, et al., (2000) Hum Gene Ther^^, 2079-91]. 
[0197] Vectors were constructed for each serotype for a number of in vitro and in vivo studies. Eight different trans- 
gene cassettes were incorporated into the vectors and recombinant virions were produced for each serotype. The 
recovery of virus, based on genome copies, is summarized in Table 4 below. The yields of vector were high for each 
serotype with no consistent differences between serotypes. Data presented in the table are average genome copy 
yields with standard deviation x 1 0^^ of multiple production lots of 50 plate (150 mm ) transfections. 



Table 4. 



Production of Recombinant Vectors 




AAV2/1 


AAV2/2 


AAV2/5 


AAV2/7 


AAV2/8 


CIVIV 


7.30+4.33 


4.49+2.89 


5.19 + 5.19 


3.42 


0.87 


LacZ 


(n=9) 


(n=6) 


(n=8) 


(n=1) 


(n=1) 


CIVIV 


6.43+2.42 


3.39 +2.42 


5.55 +6.49 


2.98+2.66 


3.74 +3.88 


EGFP 


(n=2) 


(n=2) 


(n=4) 


(n=2) 


(n=2) 


TBG LacZ 


4.18 


0.23 


0.704+0.43 


2.16 


0.532 




(n=1) 


(n=1) 


(n=2) 


(n=1) 


(n=1) 


Alb MAT 


4.67+0.75 


4.77 


4.09 


5.04 


2.02 




(n=2) 


(n=1) 


(n=1) 


(n=1) 


(n=1) 


CB A1AT 


0.567 


0.438 


2.82 


2.78 


0.816 + 




(n=1) 


(n=1) 


(n=1) 


(n=1) 


0.679 (n=2) 


TBG 


8.51+6.65 


3.47+2.09 


5.26+3.85 


6.52+3.08 


1.83±0.98 


rhCG 


(n=6) 


(n=5) 


(n=4) 


(n=4) 


(n=5) 


TBG cFIX 


1.24+1.29 


0.63+0.394 


3.74+2.48 


4.05 


15.8+15.0 




(n=3) 


(n=6) 


(n=7) 


(n=1) 


(n=5) 



Example 6 - Serologic Analysis of Pseudotyped Vectors 

[0198] G57BI_/6 mice were injected with vectors of different serotypes of AAVGBA1 AT vectors intramuscularly (5 x 

lO"""" GO) and serum samples were collected 34 days later. To test neutralizing and cross-neutralizing activity of sera 
to each serotype of AAV, sera was analyzed in a transduction based neutralizing antibody assay [Gao, G. P., et al., 
(1996) J Virol 70, 8934-43]. More specifically, the presence of neutralizing antibodies was determined by assessing 
the ability of serum to inhibit transduction of 84-31 cells by reporter viruses (AAVGMVEGFP) of different serotypes. 
Specifically, the reporter virus AAVCIVIVEGFP of each serotype [at multiplicity of infection (MO I) that led to a transduc- 
tion of 90% of indicator cells] was pre-incubated with heat- in activated serum from animals that received different se- 
rotypes of AAV or from naive mice. After 1 -hour incubation at 37° C, viruses were added to 84-31 cells in 96 well plates 
for 48 or 72- hour, depending on the virus serotype. Expression of GFP was measured by Fluorolmagin (Molecular 
Dynamics) and quantified by Image Quant Software. Neutralizing antibody titers were reported as the highest serum 
dilution that inhibited transduction to less than 50%. 

[0199] The availability of GFP expressing vectors simplified the development of an assay for neutralizing antibodies 
that was based on inhibition of transduction in a permissive cell line (i.e., 293 cells stably expressing E4 from Ad5). 
Sera to selected AAV serotypes were generated by intramuscular injection of the recombinant viruses. Neutralization 
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of AAV transduction by 1 :20 and 1 :80 dilutions of the antisera was evaluated (See Table 5 below). Antisera to AAV1 , 
AAV2, AAV5 and AAV8 neutralized transduction of the serotype to which the antiserunn was generated (AAV5 and 
AAV8 to a lesser extent than AAV1 and AAV2) but not to the other serotype (i.e., there was no evidence of cross 
neutralization suggesting that AAV 8 is a truly unique serotype). 

5 



Table 5. Serological Analysis of New AAV Serotypes. 



10 



15 





% Infection on 84-31 cells with AAVCMVEGFP virus: 


AAV2/1 


AAV2/2 


AAV2/5 


AAV2/7 


AAV2/8 


Serum dilution: 


Serum dilution: 


Serum dilution: 


Serum dilution: 


Serum dilution: 


Sera; 


Innmunization Vector 


1/20 


1/80 


1/20 


1/80 


1/20 


1/80 


1/20 


1/80 


1/20 


1/80 


Group 1 


AAV2/1 


0 


0 


100 


100 


100 


100 


100 


100 


100 


100 


Group 2 


AAV2/2 


100 


100 


0 


0 


100 


100 


100 


100 


100 


100 


Group 3 


AAV2/5 


100 


100 


100 


100 


16.5 


16.5 


100 


100 


100 


100 


Group 4 


AAV2/7 


100 


100 


100 


100 


100 


100 


61.5 


100 


100 


100 


Group 5 


AAV2/8 


100 


100 


100 


100 


100 


100 


100 


100 


26.3 


60 



20 [0200] Human sera from 52 normal subjects were screened for neutralization against selected serotypes. No serum 
sample was found to neutralize AAV2/7 and AAV2/8 while AAV2/2 and AAV2/1 vectors were neutralized in 20% and 
10% of sera, respectively. A fraction of human pooled IgG representing a collection of 60,000 individual samples did 
not neutralize AAV2/7 and AAV2/8, whereas AAV2/2 and AAV2/1 vectors were neutralized at titers of serum equal to 
1/1280 and 1/640, respectively. 

25 

Example 7 - In vivo Evaluation of Different Serotypes of AAV Vectors 

[0201] In this study, 7 recombinant AAV genomes, AAV2CBhA1AT, AAV2A1 bhAI AT, AAV2CMVrhCG, 
AAV2TBGrhCG, AAV2TBGcFIX, AAV2CMVLacZ and AAV2TBGLacZ were packaged with capsid proteins of different 

30 serotypes. In all 7 constructs, minigene cassettes were flanked with AAV2 ITRs. cDNAs of human a-antitrypsin (A1 AT) 
[Xiao, W., et al., (1999) J Virol 73, 3994-4003] |3-subunit of rhesus monkey choriogonadotropic hormone (CG) [Zoltick, 
P. W. & Wilson, J. M. (2000) Mol Ther2, 657-9] canine factor IX [Wang, L., et a!., (1997) Proc Natl Acad Scl USA 94, 
1 1 563-6] and bacterial p-glactosidase (i.e.. Lac Z) genes were used as reporter genes. For liver-directed gene transfer, 
either mouse albumin gene promoter (Alb) [Xiao, W. (1999), cited above] or human thyroid hormone binding globulin 

35 gene promoter (TBG) [Wang (1997), cited above] was used to drive liver specific expression of reporter genes. In 
muscle-directed gene transfer experiments, either cytomegalovirus early promoter (CMV) or chicken p-actin promoter 
with CMV enhancer (CB) was employed to direct expression of reporters. 

[0202] For muscle-directed gene transfer, vectors were injected into the right tibialis anterior of 4-6 week old NCR 
nude or C57BI_/6 mice (Taconic, Germantown, NY). In liver-directed gene transfer studies, vectors were infused in- 

40 traportally into 7-9 week old NCR nude or C57BI_/6 mice (Taconic, Germantown, NY). Serum samples were collected 
intraorbital ly at different time points after vector administration. Muscle and liver tissues were harvested at different 
time points for cryosectioning and Xgal histochemical staining from animals that received the lacZ vectors. For the re- 
administration experiment, C56BI_/6 mice initially received AAV2/1 , 2/2, 2/5, 2/7 and 2/8CBA1 AT vectors intramuscu- 
larly and followed for Al AT gene expression for 7 weeks. Animals were then treated with AAV2/8TBGcFIX intraportally 

45 and studied for cFIX gene expression. 

[0203] ELISA based assays were performed to quantify serum levels of hAI AT, rhCG and cFIX proteins as described 
previously [Gao, G. P., etal, (1 996) J \//ro/70, 8934-43; Zoltick, P. W. & Wilson, J. M, (2000) MolTher2, 657-9; Wang, 
L., et al., Proc Natl Acad Sci USA 94, 11563-6]. The experiments were completed when animals were sacrificed for 
harvest of muscle and liver tissues for DNA extraction and quantitative analysis of genome copies of vectors present 

50 in target tissues by TaqMan using the same set of primers and probe as in titration of vector preparations [Zhang, Y, 
et al., (2001) MolTherS, 697-707]. 

[0204] The performance of vectors base on the new serotypes were evaluated in murine models of muscle and liver- 
directed gene transfer and compared to vectors based on the known serotypes AAV1 , AAV2 and AAV5. Vectors ex- 
pressing secreted proteins (alpha-antitrypsin (A1 AT) and chorionic gonadotropin (CG)) were used to quantitate relative 
55 transduction efficiencies between different serotypes through ELISA analysis of sera. The cellular distribution of trans- 
duction within the target organ was evaluated using lacZ expressing vectors and X-gal histochemistry. 
[0205] The performance of AAV vectors in skeletal muscle was analyzed following direct injection into the tibialis 
anterior muscles. Vectors contained the same AAV2 based genome with the immediate early gene of CMV or a CMV 
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enhanced |3-actin pronnoter driving expression of the transgene. Previous studies indicated that innmune competent 
C57BI_/6 mice elicit limited humoral responses to the human A1 AT protein when expressed from AAV vectors [Xiao, 
W., et al, (1999) J Virol 73, 3994-4003]. 

[0206] In each strain, AAV2/1 vector produced the highest levels of A1 AT and AAV2/2 vector the lowest, with AAV2/7 
5 and AAV2/8 vectors showing intermediate levels of expression. Peak levels of CG at 28 days following injection of nu/ 
nu NCR mice showed the highest levels from AAV2/7 and the lowest from AAV2/2 with AAV2/8 and AAV2/1 in between. 
Injection of AAV2/1 and AAV2/7 lacZ vectors yielded gene expression at the injection sites in all muscle fibers with 
substantially fewer lacZ positive fibers observed with AAV2/2 and AAV 2/8 vectors. These data indicate that the effi- 
ciency of transduction with AAV2/7 vectors in skeletal muscle is similar to that obtained with AAV2/1 , which is the most 
10 efficient in skeletal muscle of the previously described serotypes [Xiao, W. (1999), cited above; Chao, H., et al., (2001) 
Mol Ther4, 217-22; Chao, H., et al, (2000) Mol Ther2, 619-23]. 

[0207] Similar murine models were used to evaluate liver-directed gene transfer. Identical doses of vector based on 

genome copies were infused into the portal veins of mice that were analyzed subsequently for expression of the trans- 
gene. Each vector contained an AAV2 based genome using previously described liver-specific promoters (i.e., albumin 
15 or thyroid hormone binding globulin) to drive expression of the transgene. More particularly, CMVCG and TBGCG 
minigene cassettes were used for muscle and liver-directed gene transfer, respectively. Levels of rhCG were defined 
as relative units (RUsx 1 0^). The data were from assaying serum samples collected at day 28, post vector administration 
(4 animals per group). As shown in Table 3, the impact of capsid proteins on the efficiency of transduction of Al AT 
vectors in nu/nu and C57BI_/6 mice and CG vectors in C57BL/6 mice was consistent (See Table 6). 

20 

Table 6. 





Expression of p-unitof Rhesus Monkey Chorionic Gonadotropin (rhCG) 


Vector 


Muscle 


Liver 


25 


AAV2/1 


4.5 + 2.1 


1.6+1.0 




AAV2 


0.5 ± 0.1 


0.7 + 0.3 




AAV2/5 


ND* 


4.8 + 0.8 




AAV2/7 


14.2 + 2.4 


8.2 + 4.3 


30 


AAV2/8 


4.0 ± 0.7 


76.0 + 22.8 



* Not determined in this experiment. 



[0208] In all cases, AAV2/8 vectors yielded the highest levels of transgene expression that ranged from 16 to 110 
greater than what was obtained with AAV2/2 vectors; expression from AAV2/5 and AAV2/7 vectors was intermediate 
35 with AAV2/7 higher than AAV2/5. Analysis of X-Gal stained liver sections of animals that received the corresponding 
lacZ vectors showed a correlation between the number of transduced cells and overall levels of transgene expression. 
DNAs extracted from livers of C57BI_/6 mice who received the Al AT vectors were analyzed for abundance of vector 
DNA using real time PGR technology. 

[0209] The amount of vector DNA found in liver 56 days after injection correlated with the levels of transgene ex- 
40 pression (See Table 7). For this experiment, a set of probe and primers targeting the SV40 polyA region of the vector 
genome was used for TaqMan PCR. Values shown are means of three individual animals with standard deviations. 
The animals were sacrificed at day 56 to harvest liver tissues for DNA extraction. These studies indicate that AAV8 is 
the most efficient vector for liver-directed gene transfer due to increased numbers of transduced hepatocytes. 



Table 7 - 



Real Time PCR Analysis for Abundance of AAV Vectors In nu/nu Mouse Liver Following Injection of IxlC'' 


Genome Copies of Vector. 


AAV vectors/Dose 


Genome Copies per Ceil 


AAV2/1AlbA1AT 


0.6 + 0.36 


AAV2AlbA1AT 


0.003 + 0.001 


AAV2/5AlbA1AT 


0.83 ± 0.64 


AAV2/7AlbA1AT 


2.2 ± 1.7 


AAV2/8AlbA1AT 


18± 11 



[0210] The serologic data described above suggest that AAV2/8 vector should not be neutralized in vivo following 
immunization with the other serotypes. C57BLy6 mice received intraportal injections of AAV2/8 vector expressing canine 
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factor IX (10"'"' genome copies) 56 days after they received intramuscular injections of A1 AT vectors of different sero- 
types, High levels of factor IX expression were obtained 14 days following infusion of AAV2/8into naive animals (17±2 
|xg/ml, n=4) which were not significantly different that what was observed in animals immunized with AAV2/1 (31+23 
[ig/ml, n=4), AAV2/2 (16 jig/ml, n=2), and AAV2/7 (12 |xg/ml, n=2). This contrasts to what was observed in AAV2/8 
5 immunized animals that were infused with the AAV2/8 factor IX vector in which no detectable factor IX was observed 
(< 0.1 ^g/mL n=4). 

[0211] Oligonucleotides to conserved regions of the cap gene did amplify sequences from rhesus monkeys that 
represented unique AAVs. Identical cap signature sequences were found in multiple tissues from rhesus monkeys 
derived from at least two different colonies. Full-length rep and cap open reading frames were isolated and sequenced 

10 from single sources. Only the cap open reading frames of the novel AAVs were necessary to evaluate their potential 
as vectors because vectors with the AAV7 or AAVS capsids were generated using the ITRs and rep from AAV2. This 
also simplified the comparison of different vectors since the actual vector genome is identical between different vector 
serotypes. In fact, the yields of recombinant vectors generated using this approach did not differ between serotypes. 
[0212] Vectors based on AAV7 and AAVS appear to be immunologically distinct (i.e., they are not neutralized by 

15 antibodies generated against other serotypes). Furthermore, sera from humans do not neutralize transduction by AAV7 
and AAVS vectors, which is a substantial advantage over the human derived AAVs currently under development for 
which a significant proportion of the human population has pre-existing immunity that is neutralizing [Chirmule, N., et 
al., (1999) Gene Ther6, 1574-83]. 

[0213] The tropism of each new vector is favorable for in vivo applications. AAV2/7 vectors appear to transduce 
20 skeletal muscle as efficiently as AAV2/1 , which is the serotype that confers the highest level of transduction in skeletal 
muscle of the primate AAVs tested to date [Xiao, W., cited above; Chou (2001), cited above, and Chou (2000), cited 
above]. Importantly, AAV2/8 provides a substantial advantage over the other serotypes in terms of efficiency of gene 
transfer to liver that until now has been relatively disappointing in terms of the numbers of hepatocytes stably trans- 
duced. AAV2/8 consistently achieved a 10 to 100-fold improvement in gene transfer efficiency as compared to the 
25 other vectors. The basis for the improved efficiency of AAV2/8 is unclear, although it presumably is due to uptake via 
a different receptor that is more active on the basolateral surface of hepatocytes. This improved efficiency will be quite 
useful in the development of liver-directed gene transfer where the number of transduced cells is critical, such as in 
urea cycle disorders and familial hypercholesterolemia. 

[0214] Thus, the present invention provides a novel approach for isolating new AAVs based on PGR retrieval of 
30 genomic sequences. The amplified sequences were easily incorporated into vectors and tested in animals. The lack 
of pre-existing immunity to AAV7 and the favorable tropism of the vectors for muscle indicates that AAV7 is suitable 
for use as a vector in human gene therapy and other in vivo applications. Similarly, the lack of pre-existing immunity 
to the AAV serotypes of the invention, and their tropisms, renders them useful in delivery of therapeutic molecules and 
other useful molecules. 

35 

Example 9 - Tissue Tropism Studies 

[0215] In the design of a high throughput functional screening scheme for novel AAV constructs, a non-tissue specific 
and highly active promoter, CB promoter (CMV enhanced chicken (3 actin promoter) was selected to drive an easily 

40 detectable and quantifiable reporter gene, human a anti-trypsin gene. Thus only one vector for each new AAV clone 
needs to be made for gene transfer studies targeting 3 different tissues, liver, lung and muscle to screen for tissue 
tropism of a particular AAV construct. The following table summarizes data generated from 4 novel AAV vectors in the 
tissue tropism studies (AAVCBA1 AT), from which a novel AAV capsid clone, 44.2, was found to be a very potent gene 
transfer vehicle in all 3 tissues with a big lead in the lung tissue particularly. Table 8 reports data obtained (in \Lg A1 AT/ 

45 mL serum) at day 14 of the study. 



Table 8 



50 



Vector 


Target Tissue 


Lung 


Liver 


Muscle 


AAV2/1 


ND 


ND 


45±11 


AAV2/5 


0.6+0.2 


ND 


ND 


AAV2/8 


ND 


84+30 


ND 


AAV2/rh.2 (43.1) 


14±7 


25+7.4 


35±14 


AAV2/rh.1 0(44.2) 


23±6 


53+19 


46±11 
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Tables (continued) 



Vector 


Target Tissue 


Lung 


Liver 


Muscle 


AAV2/rh.13(42.2) 


3.5+2 


2+0.8 


3.5+1.7 


AAV2/rh.21 (42.10) 


3.1±2 


2±1.4 


4.3±2 



A couple of other experinnents were then performed to confirnn the superior tropisnn of AAV 44.2 in lung tissue. First, 
AAV vector carried CC1 OhAI AT nninigene for lung specific expression were pseudotyped with capsids of novel AAVs 
were given to Innmune deficient aninnals (NCR nude) in equal volume (50 |a,l each of the original preps without dilution) 
via intratracheal injections as provided in the following table. In Table 9, 50 jii of each original prep per mouse, NCR 
Nude, detection limit >0.033 ^g/ml, Day 28 



Table 9 



25 



35 



Vector 


Total GO in 
50 ^1 vector 


jag of A1 AT/ml 
with 50^1 
vector 


jxg of A1 AT/ml 
with 1x1011 
vector 


Relative Gene transfer as compared 
to rh.10 (clone 44.2) 


2/1 


3x10^2 


2.6+0.5 


0.09+0,02 


2,2 


2/2 


5.5x1011 


<0.03 


<0.005 


<0.1 


2/5 


3.6x10^2 


0.65±0.16 


0.02±0.004 


0.5 


2/7 


4.2x10^2 


1±0.53 


0.02+0,01 


0.5 


2/8 


7.5x1011 


0.9+0.7 


0.12+0.09 


2.9 


2/ch.5 (A.3.1) 


9x1012 


1+0.7 


0.01 + 0.008 


0.24 


2/rh.8 (43.25) 


4.6x1012 


26+21 


0.56+0.46 


13.7 


2/rh.10 (44.2) 


2.8x1012 


115+38 


4.1±1.4 


100 


2/rh.13 (42.2) 


6x1012 


7.3+0.8 


0.12+0.01 


2.9 


2/rh,21 (42.10) 


2.4x10^2 


9+0.9 


0.38+0,04 


9,3 


2/rh.22 (42.11) 


2.6x1012 


6+0.4 


0.23+0.02 


5.6 


2/rh.24 (42.13) 


1.1x1011 


0.4+0.3 


0.4+0.3 


1 



The vectors were also administered to immune competent animals (C57BI_/6) in equal genome copies (1x1 on GC) 
as shown in the Table 1 0. (1 x1 OH GC per animal, C57BLy6, day 1 4, detection limit >0.033 ^ig/ml) 

40 

Table 10 



AAV Vector 


of A1 AT/m 1 with 1 x1 0"" i vecto r 


Relative Gene transfer as compared to rh.10 (clone 
44.2) 


2/1 


0.076+0.031 


2.6 


2/2 


0.1+0.09 


3.4 


2/5 


0,0840.033 


2.9 


2/7 


0.33+0.01 


11 


2/8 


1 .92+1 .3 


2.9 


2/ch.5 (A.3.1) 


0.048+0.004 


1.6 


2/rh.8 (43.25) 


1.7+0.7 


58 


2/rh.10 (44.2) 


2.93+1.7 


100 


2/rh.13 (42.2) 


0.45+0.15 


15 


2/rh.21 (42.10) 


0.86+0.32 


29 
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Table 1 0 (continued) 



AAV Vector 


of A1 AT/m 1 with 1 x1 C ^ vecto r 


Relative Gene transfer as compared to rh.10 (clone 
44.2) 


2/rh.22 (42.11) 


0.38+0.18 


13 


2/rh.24 (42.13) 


0.3±0.19 


10 



[0216] The data from both experiments confirmed the superb tropism of clone 44.2 in lung-directed gene transfer. 
[0217] Interestingly, performance of clone 44.2 in liver and muscle directed gene transfer was also outstanding, close 
to that of the best liver transducer, AAV8 and the best muscle transducer AAV1 , suggesting that this novel AAV has 
some intriguing biological significance. 

[0218] To study serological properties of those novel AAVs, pseudotyped AAVGFP vectors were created for immu- 
nization of rabbits and in vitro transduction of 84-31 cells in the presence and absence of antisera against different 
capsids. The data are summarized below: 



Table 11a. 



Cross-NAB assay in 8431 cells and adenovirus (Adv) colnfection 
Infection in 8431 cells (colnfected with Adv) with: 


Serum from rabbit immunized with: 


iqQ GC 


109 GC 


10^ GC 


10^0 GC 


rh.13 


rh.21 


rh.22 


rh.24 




AAV2/42.2 


AAV2/42.10 


AAV2/42.11 


AAV2/42.13 


AAV2/1 


1/20 


1/20 


1/20 


No NAB 


AAV2/2 


1/640 


1/1280 


1/5120 


No NAB 


AAV2/5 


No NAB 


1/40 


1/160 


No NAB 


AAV2/7 


1/81920 


1/81920 


1/40960 


1/640 


AAV2/8 


1/640 


1/640 


1/320 


1/5120 


C/7.5AAV2/A3 


1/20 


1/160 


1/640 


1/640 


rh.8 
AAV2/43.25 


1/20 


1/20 


1/20 


1/320 


rh.10 

AAV2/44.2 


No NAB 


No NAB 


No NAB 


1/5120 


rh.13 
AAV2/42.2 


1/5120 


1/5120 


1/5120 


No NAB 


rh.21 

AAV2/42.10 


1/5120 


1/10240 


1/5120 


1/20 


rh.22 
AAV2/42.11 


1/20480 


1/20480 


1/40960 


No NAB 


rh.24 

AAV2/42.13 


No NAB 


1/20 


1/20 


1/5120 
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Cross-NAB assay in 8431 cells and Adv colnfectlon 
Infection In 8431 cells (colnfected with Adv) with: 


Serum from rabbit immunized with: 


10^ GC 


10^° GC 


10^"GC 


10^ GC 


10® GC 


rh. 12 


ch.5 


rh.8 


rh.10 


rh.20 




AAV2/42.1 B 


AAV / A ^ 

AA\/2/A3 


AAV 2/43, 2 5 


A A V ir^ i A A r\ 


AAV2/42.8.2 


AAV2/1 


No NAB 


1/20480 


No NAB 


1/80 


ND 


AAV2/2 


1/20 


No NAB 


No NAB 


No NAB 


ND 


AAV2/5 


No NAB 


1/320 


No NAB 


No NAB 


ND 


AAV2/7 


1/2560 


1/640 


1/160 


1/81920 


ND 


AAV2/8 


1/1 0240 


1 /2560 


1/2560 


1/81 920 


ND 


cn.5 AAV2/A3 


1/1280 


1/1 0240 


ND 


1/5120 


1/320 


r/i.aAAV2/43.25 


1/1280 


ND 


1/20400 


1/5120 


1/2560 


rh.10 AAV2/44.2 


1/5120 


ND 


ND 


1/5120 


1/5120 


rA7.-/5 AAV2/42.2 


1/20 


ND 


ND 


No NAB 


1/320 


rAi.2y AAV2/42 ,10 


1/20 


ND 


ND 


1/40 


1/80 


r/i.22AAV2/42.11 


No NAB 


ND 


ND 


ND 


No NAB 


r/i.24AAV2/42.13 


1/5120 


ND 


ND 


ND 


1/2560 



Table 12 



Titer of rabbit sera 


Titer after 


Vector Titer d21 


Boosting 


ch.5 


AAV2/A3 


1/10,240 


1/40,960 


rh.8 


AAV2/43.25 


1/20,400 


1/163,840 


rh.10 


AAV2/44.2 


1/10,240 


1/527,680 


rh.13 


AAV2/42.2 


1/5,120 


1/20,960 


rh.21 


AAV2/42.10 


1/20,400 


1/81,920 


rh.22 


AAV2/42.11 


1/40,960 


ND 


rh.24 


AAV2/42.13 


1/5,120 


ND 



Table 13 a. 



Infection in 8431 cells (co infected with Adv) with GFP 




109 GC/well 


109 GC/well 


109 GC/well 


109 GC/well 


109 GC/well 


109 GCAwell 
ch.5 




AAV2/1 


AAV2/2 


AAV2/5 


AAV2/7 


AAV2/8 


AAV2/A3 


#GFU/field 


128 


>200 


95 


56 


13 


1 


83 


>200 


65 


54 


11 


1 
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Table 13b. 



Infection in 8431 cells (coinfected with Adv) with GFP 




10^ GC/well 


1 09 GC/ 
well 


1 09 GC/ 
well 


109 GC/well 


1 09 GC/well 


10® GC/well 


10^ GC/well 


rh.8 


rh.10 


rh.13 


rh.21 


rh.22 


rh.24 


rh.12 




AAV2/43.25 


AAV2/44.2 


AAV2/42.2 


AAV2/42.10 


AAV 2/42. 11 


AAV2/42.13 


AAV2/42,1B 


# 

GFU/ 
field 


3 


13 


54 


62 


10 


3 


18 


2 


12 


71 


60 


14 


2 


20 






48 


47 


16 


3 


12 



Example 10 - Mouse Model of Familial Hypercholesterolemia 



[0219] The following experiment demonstrates that the AAV2/7 construct of the invention delivers the LDL receptor 
and express LDL receptor in an amount sufficientto reduce the levels of plasma cholesterol and triglycerides in animal 
models of familial hypercholesterolemia. 

20 

A. Vector Construction 

[0220] AAV vectors packaged with AAV7 or AAV8 capsid proteins were constructed using a pseudotyping strategy 
[Hildinger M, et al., J. Virol 200-\ ; 75:6199-6203]. Recombinant AAV genomes with AAV2 inverted terminal repeats 

25 (ITR) were packaged by triple transfection of 293 cells with the c/s-plasmid, the adenovirus helper plasmid and a 
chimeric packaging construct, a fusion of the capsids of the novel AAV serotypes with the rep gene of AAV2. The 
chimeric packaging plasmid was constructed as previously described [Hildinger et al, cited above]. The recombinant 
vectors were purified by the standard CSCI2 sedimentation method. To determine the yield TaqMan (Applied Biosys- 
tems) analysis was performed using probes and primers targeting the SV40 poly(A) region of the vectors [Gao GP, et 

30 al., Hum Gene Tlier. 2000 Oct 10;11(15):2079-91]. The resulting vectors express the transgene under the control of 
the human thyroid hormone binding globulin gene promoter (TBG). 

B. Animals 

35 [0221] LDL receptor deficient mice on the C57B1/6 background were purchased from the Jackson Laboratory (Bar 
Harbor, ME, USA) and maintained as a breeding colony. Mice were given unrestricted access to water and obtained 
a high fat Western Diet (high % cholesterol) starting three weeks prior vector injection. At day -7 as well at day 0, blood 
was obtained via retroorbital bleeds and the lipid profile evaluated. The mice were randomly divided into seven groups. 
The vector was injected via an intraportal injection as previously described ([Chen SJ et al., Mol Therapy 2000; 2(3), 
256-261]. Briefly, the mice were anaesthetized with ketamine and xylazine. A laparotomy was performed and the portal 
vein exposed. Using a 30g needle the appropriate dose of vector diluted in lOOul PBS was directly injected into the 
portal vein. Pressure was applied to the injection site to ensure a stop of the bleeding. The skin wound was closed and 
draped and the mice carefully monitored for the following day. Weekly bleeds were performed starting at day 14 after 
liver directed gene transfer to measure blood lipids. Two animals of each group were sacrificed at the time points week 
6 and week 12 after vector injection to examine atherosclerotic plaque size as well as receptor expression. The re- 
maining mice were sacrificed at week 20 for plaque measurement and determination of transgene expression. 



Table 14 





Vector 


dose 


n 


Group 1 


AAV2/7-TBG-hLDLr 


1X 10"'2 gc 


12 


Group 2 


AAV2/7-TBG-hLDLr 


3x 1 0^^ gc 


12 


Group 3 


AAV2/7-TBG-hLDLr 


1x 101"" gc 


12 


Group 4 


AAV2/8-TBG-hLDLr 


1x 1012 gc 


12 


Group 5 


AAV2/8-TBG-hLDLr 


3x IOII gc 


12 
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Table 14 (continued) 





Vector 


dose 


n 


Group 6 


AAV2/8-TBG-hLDLr 


1x lO""! gc 


12 


Group 7 


AAV2/7-TBG-LacZ 


1x lO""! gc 


16 



C. Serum lipoprotein and liver function analysis 

to [0222] Blood samples were obtained from the retroorbital plexus after a 6 hour fasting period. The serum was sep- 
arated from the plasma by centrifugation. The amount of plasma lipoproteins and liver transaminases in the serum 
were detected using an automatized clinical chemistry analyzer (ACE, Schiapparelli Biosystems, Alpha Wassermann) 

D. Detection oftransgene expression 

15 

[0223] LDL receptor expression was evaluated by immu no-fluorescence staining and Western blotting. For Western 
Blot frozen liver tissue was homogenized with lysis buffer ( 20 mM Tris, pH7.4, 130mM NaCI, 1% Triton X ICQ, protei- 
nase inhibitor (complete, EDTA-free, Roche, Mannheim, Germany). Protein concentration was determined using the 
Micro BCA Protein Assay Reagent Kit (Pierce, Rockford, IL). 40 jag of protein was resolved on 4- 15% Tris-HCI Ready 
20 Gels (Biorad, Hercules, CA) and transferred to a nitrocellulose membrane (Invitrogen, ). To generate Anti-hLDL receptor 
antibodies a rabbit was injected intravenously with an AdhLDLr prep (1x1 0"'^ GO). Four weeks later the rabbit serum 
was obtained and used for Western Blot. A 1 :1 00 dilution of the serum was used as a primary antibody followed by a 
HRP-conjugated anti-rabbit IgG and ECL chemiluminescent detection (ECL Western Blot Detection Kit, Amersham, 
Arlington Heights, IL). 

25 

E. Immunocytochemistry 

[0224] For determination of LDL receptor expression in frozen liver sections immunohistochemistry analyses were 
performed. 10um cryostat sections were either fixed in acetone for 5 minutes, or unfixed. Blocking was obtained via a 

30 1 hour incubation period with 1 0% of goat serum. Sections were then incubated for one hour with the primary antibody 
at room temperature. A rabbit polyclonal antibody anti-human LDL (Biomedical Technologies Inc., Stoughton, MA) was 
used diluted accordingly to the instructions of the manufacturer. The sections were washed with PBS, and incubated 
with 1:100 diluted fluorescein goat anti-rabbit IgG (Sigma, St Louis, MO). Specimens were finally examined under 
fluorescence microscope Nikon Microphot-FXA. In all cases, each incubation was followed by extensive washing with 

35 PBS. Negative controls consisted of preincubation with PBS, omission of the primary antibody, and substitution of the 
primary antibody by an isotype-matched non-immune control antibody. The three types of controls mentioned above 
were performed for each experiment on the same day. 

F. Gene transfer efficiency 

40 

[0225] Liver tissue was obtained after sacrificing the mice at the designated time points. The tissue was shock frozen 
in liquid nitrogen and stored at -80°G until further processing. DNAwas extracted from the liver tissue using a QIAamp 
DNA Mini Kit (QIAGEN GmbH, Germany) according to the manufacturers protocol. Genome copies of AAV vectors in 
the liver tissue were evaluated using Taqman analysis using probes and primers against the SV40 poly(A) tail as 

45 described above. 

G. Atherosclerotic plaque measurement 

[0226] For the quantification of the atherosclerotic plaques in the mouse aorta the mice were anaesthetized (10% 
50 ketamine and xylazine, ip), the chest opened and the arterial system perfused with ice-cold phosphate buffered saline 
through the left ventricle. The aorta was then carefully harvested, slit down along the ventral midline from the aortic 
arch down to the femoral arteries and fixed in formalin. The lipid-rich atherosclerotic plaques were stained with Sudan 
IV (Sigma, Germany) and the aorta pinned out flat on a black wax surface. The image was captured with a Sony DXG- 
960 MD color video camera. The area of the plaque as well as of the complete aortic surface was determined using 
55 Phase 3 Imaging Systems (Media Cybernetics). 
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H. Clearance of LDL 

[0227] Two animals per experimental group were tested. A bolus of M^s . labeled LDL (generously provided by Dan 
Rader, U Penn) was infused slowly through the tail vein over a period of 30 sec (1 ,000,000 counts of [M^sj-LDL diluted 

5 in 1 00[i\ sterile PBS/ animal). At time points 3min, 30 min, 1 .5hr, 3hr, 6hr after injection a blood sample was obtained 
Wa the retro-orbital plexus. The plasma was separated off from the whole blood and 1 0^1 plasma counted in the gamma 
counter. Finally the fractional catabolic rate was calculated from the lipoprotein clearance data. 

I. Evaluation of Liver Lipid accumulation 

10 

[0228] Oil Red Staining of frozen liver sections was performed to determine lipid accumulation. The frozen liver 
sections were briefly rinsed in distilled water followed by a 2 minute incubation in absolute propylene glycol. The sections 
were then stained in oil red solution (0.5% in propylene glycol) for 16 hours followed by counterstaining with Mayer's 
hematoxylin solution for 30 seconds and mounting in warmed glycerin jelly solution. 

15 [0229] For quantification of the liver cholesterol and triglyceride content liver sections were homogenized and incu- 
bated in chloroform/methanol (2:1) overnight. After adding of 0.05% H2S04and centrifugation for 1 0 minutes, the lower 
layer of each sample was collected, divided in two aliquots and dried under nitrogen. For the cholesterol measurement 
the dried lipids of the first aliquot were dissolved in 1% Triton X-100 in chloroform. Once dissolved, the solution was 
dried under nitrogen. After dissolving the lipids in ddH20 and incubation for 30 minutes at 37°G the total cholesterol 

20 concentration was measured using a Total Cholesterol Kit (Wako Diagnostics). For the second aliquot the dried lipids 
were dissolved in alcoholic KOH and incubated at 60°G for 30 minutes. Then 1M MgCI2 was added, followed by 
incubation on ice for 10 minutes and centrifugation at 14,000 rpmfor30 minutes. The supernatant was finally evaluated 
for triglycerides (Wako Diagnostics). 

[0230] All of the vectors pseudotyped in an AAV2/8 or AAV2/7 capsid lowered total cholesterol, LDL and triglycerides 
25 as compared to the control. These test vectors also corrected phenotype of hypercholesterolemia in a dose-dependent 

manner. A reduction in plaque area for the AAV2/8 and AAV2/7 mice was observed in treated mice at the first test (2 
months), and the effect was observed to persist over the length of the experiment (6 months). 

Example 10 - Functional Factor IX Expression and Correction of Hemophilia 

30 

A. Knock-Out Mice 

[0231] Functional canine factor IX (FIX) expression was assessed in hemophilia B mice. Vectors with capsids of 
AAV1 , AAV2, AAV5, AAV7 or AAV8 were constructed to deliver AAV2 5' ITR - liver-specific promoter [LSP] - canine 
35 FIX - woodchuck hepatitis post-regulatory element (WPRE) - AA\/2 3' ITR . The vectors were constructed as described 
in Wang et al, 2000, Molecular Therapy 2: 154-158), using the appropriate capsids. 

[0232] Knock-out mice were generated as described in Wang et al, 1 997. Proc. NatL Acad. Scl. USA 94: 1 1 563-1 1 566. 
This model closely mimic the phenotypes of hemophilia B in human. 

[0233] Vectors of different serotypes (AAV1 , AAV2, AAV5, AAV7 and AAV8) were delivered as a single intraportal 
40 injection into the liver of adult hemophiliac C57B1/6 mice in a dose of 1 x1 0"'"' GC/mouse forthe five different serotypes 
and one group received an AAV8 vector at a lower dose, IxlO'io GG/mouse. Control group was injected with IxlO"""" 
GC of AAV2/8 TBG LacZ3. Each group contains 5-10 male and female mice. Mice were bled bi-weekly after vector 
administration. 

45 1 . ELISA 

[0234] The canine FIX concentration in the mouse plasma was determined by an ELISA assay specific for canine 
factor IX, performed essentially as described by Axelrod et al, 1990, Proc.Natl.Acad.Scl. USA, 87:5173-6177 mXh 
modifications. Sheep anti-canine factor IX (Enzyme Research Laboratories) was used as primary antibody and rabbit 
50 anti-canine factor IX ((Enzyme Research Laboratories) was used as secondary antibody. Beginning at two weeks 
following injection, increased plasma levels of cFIX were detected for all test vectors. The increased levels were sus- 
tained attherapeutic levels throughoutthe length of the experiment, i.e., to 12 weeks. Therapeutic levels are considered 
to be 5% of normal levels, i.e., at about 250 ng/mL. 

[0235] The highest levels of expression were observed forthe AAV2/8 (at 1 0"""") and AAV2/7 constructs, with sustained 
55 superphysiology levels cFIX levels (ten-fold higher than the nomnal level). Expression levels for AAV2/8 (lO"""") were 
approximately 10 fold higher than those observed f or AAV2/2 and AAV2/8 (1 0'lO). The lowest expression levels, although 
still above the therapeutic range, were observed for AAV2/5. 
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2. In Vitro Activated Partial Thromboplastin time (aPTT) Assay 

[0236] Functional factor IX activity in plasnna of the FIX knock-out mice was determined by an in vitro activated partial 
thromboplastin time (aPTT) assay-Mouse blood samples were collected from the retro-orbital plexus into 1/1 0 volume 
5 of citrate buffer. The aPTT assay was performed as described by Wang et al, 1997, Proc. Natl. Acad. Scl. USA 94: 
11563-11566. 

[0237] Clotting times by aPTT on plasma samples of all vector injected mice were within the normal range (approx- 
imately 60 sec) when measured at two weeks post-injection, and sustained clotting times in the nomnal or shorter than 
normal range throughout the study period (12 weeks). 

10 [0238] Lowest sustained clotting times were observed in the animals receiving AA\/2/8 (10"'"') and AAV2/7. By week 
12, AAV2/2 also induced clotting times similar to those for AAV2/8 and AAV2/7. However, this lowered clotting time 
was not observed for AAV2/2 until week 12, whereas lowered clotting times (in the 25 - 40 sec range) were observed 
for AAV2/8 and AAV2/7 beginning at week two. 

[0239] Immuno-histochemistry staining on the liver tissues harvested from some of the treated mice is currently being 
15 performed. About 70-80% of hepatocytes are stained positive for canine FIX in the mouse injected with AAV2/8.cFIX 
vector. 

B. Hemophilia B Dogs 

20 [0240] Dogs that have a point mutation in the catalytic domain of the F.IX gene, which, based on modeling studies, 
appears to render the protein unstable, suffer from hemophilia B [Evans et al, 1989, Proc. Natl. Acad. Scl. USA, 86: 
10095-10099). A colony of such dogs has been maintained for more than two decades at the University of North 

Carolina, Chapel Hill. The homeostatic parameters of these dogs are well described and include the absence of plasma 
F.IX antigen, whole blood clotting times in excess of 60 minutes, whereas normal dogs are 6-8 minutes, and prolonged 
25 activated partial thromboplastin time of 50-80 seconds, whereas normal dogs are 13-28 seconds. These dogs expe- 
rience recurrent spontaneous hemorrhages. Typically, significant bleeding episodes are successfully managed by the 
single intravenous infusion of 10 ml/kg of normal canine plasma; occasionally, repeat infusions are required to control 
bleeding. 

[0241] Four dogs are injected intraportally with AAV.cFIX according to the schedule below. A first dog receives a 

30 single injection with AAV2/2.cFIX at a dose of 3.7x1 0"'"' genome copies (GC)/kg. A second dog receives a first injection 
of AAV2/2.CFIX (2.8x101"' QC/kg), followed by a second injection with AAV2/7.cFIX (2,3x1 0^^ GC/kg) at day 1180. A 
third dog receives a single injection with AAV2/2.cFIXat adoseof 4.6x10''2 GC/kg. The fourth dog receives an injection 
with AAV2/2.CFIX (2.8x1 0^2 GC/kg) and an injection at day 995 with AAV2/7.cFIX (5x10^2 GC/kg). 
[0242] The abdomen of hemophilia dogs are aseptically and surgically opened under general anesthesia and a single 

35 infusion of vector is administered into the portal vein. The animals are protected from hemorrhage in the peri-operative 
period by intravenous administration of normal canine plasma. The dog is sedated, intubated to induce general an- 
esthesia, and the abdomen shaved and propped. After the abdomen is opened, the spleen is moved into the operative 
field. The splenic vein is located and a suture is loosely placed proximal to a small distal incision in the vein. A needle 
is rapidly inserted into the vein, then the suture loosened and a 5 F cannula is threaded to an intravenous location near 

40 the portal vein threaded to an intravenous location near the portal vein bifurcation. After hemostasis is secured and 
the catheter balloon inflated, approximately 5.0 ml of vector diluted in PBS is infused into the portal vein over a 5 minute 
interval. The vector infusion is followed by a 5.0 ml infusion of saline. The balloon is then deflated, the callula removed 
and venous hemostasis is secured. The spleen is then replaced, bleeding vessels are cauterized and the operative 
wound is closed. The animal is extubated having tolerated the surgical procedure well. Blood samples are analyzed 

45 as described. [Wang et al, 2000, Molecular Therapy 2: 154-158] 

[0243] Results showing correction or partial correction are anticipated for AAV2/7. 

[0244] All publications cited in this specification are incorporated herein by reference. While the invention has been 
described with reference to a particularly preferred embodiments, it will be appreciated that modifications can be made 
without departing from the spirit of the invention. Such modifications are intended to fall within the scope of the claims. 

50 
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<210> 1 
<211> 4721 
<212> DNA 

<213> adeno-associated virus serotype 7 

30 <400> 1 

ttggccactc cctctatgcg cgctcgctcg ctcggtgggg cctgcggacc aaaggtccgc 60 

agacggcaga gctctgctct gccggcccca ccgagcgagc gagcgcgcat agagggagtg 120 

35 gccaactcca tcactagggg taccgcgaag cgcctcccac gctgccgcgt cagcgctgac 180 

gtaaatcacg tcatagggga gtggtcctgt attagctgtc acgtgagtgc ttttgcgaca 240 

ttttgcgaca ccacgtggcc atttgaggta tatatggccg agtgagcgag caggatctcc 300 

40 attttgaccg cgaaatttga acgagcagca gccatgccgg gtttctacga gatcgtgatc 360 

aaggtgccga gcgacctgga cgagcacctg ccgggcattt ctgactcgtt tgtgaactgg 420 

gtggccgaga aggaatggga gctgcccccg gattctgaca tggatctgaa tctgatcgag 480 

45 caggcacccc tgaccgtggc cgagaagctg cagcgcgact tcctggtcca atggcgccgc 540 

gtgagtaagg ccccggaggc cctgttcttt gttcagttcg agaagggcga gagctacttc 600 

caccttcacg ttctggtgga gaccacgggg gtcaagtcca tggtgctagg ccgcttcctg 660 

50 agtcagattc gggagaagct ggtccagacc atctaccgcg gggtcgagcc cacgctgccc 720 

aactggttcg cggtgaccaa gacgcgtaat ggcgccggcg gggggaacaa ggtggtggac 780 

gagtgctaca tccccaacta cctcctgccc aagacccagc ccgagctgca gtgggcgtgg 840 

actaacatgg aggagtatat aagcgcgtgt ttgaacctgg ccgaacgcaa acggctcgtg 900 
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55 



gcgcagcacc 


tgacccacgt 


cagccagacg 


caggagcaga 


acaaggagaa 


tctgaacccc 


960 


aattctgacg 


cgcccgtgat 


caggtcaaaa 


acctccgcgc 


gctacatgga 


gctggtcggg 


1020 


tggctggtgg 


accggggcat 


cacctccgag 


aagcagtgga 


tccaggagga 


ccaggcctcg 


1080 


tacatctcct 


tcaacgccgc 


ctccaactcg 


cggtcccaga 


tcaaggccgc 


gctggacaat 


1140 


gccggcaaga 


tcatggcgct 


gaccaaatcc 


gcgcccgact 


acctggtggg 


gccctcgctg 


1200 


cccgcggaca 


ttaaaaccaa 


ccgcatctac 


cgcatcctgg 


agctgaacgg 


gtacgatcct 


1260 


gcctacgccg 


gctccgtctt 


tctcggctgg 


gcccagaaaa 


agttcgggaa 


gcgcaacacc 


1320 


atctggctgt 


ttgggcccgc 


caccaccggc 


aagaccaaca 


ttgcggaagc 


catcgcccac 


1380 


gccgtgccct 


tctacggctg 


cgtcaactgg 


accaatgaga 


actttccctt 


caacgattgc 


1440 


gtcgacaaga 


tggtgatctg 


gtgggaggag 


ggcaagatga 


cggccaaggt 


cgtggagtcc 


1500 


gccaaggcca 


ttct cggcgg 


cagcaaggtg 


cgcgtggacc 


aaaagtgcaa 


gtcgtccgcc 


1560 


cagatcgacc 


ccacccccgt 


gatcgtcacc 


tccaacacca 


acatgtgcgc 


cgtgattgac 


1620 


gggaacagca 


ccaccttcga 


gcaccagcag 


ccgttgcagg 


accggatgtt 


caaatttgaa 


1680 


ctcacccgcc 


gtctggagca 


cgactttggc 


aaggtgacga 


agcaggaagt 


caaagagttc 


1740 


ttccgctggg 


ccagtgatca 


cgtgaccgag 


gtggcgcatg 


agttctacgt 


cagaaagggc 


1800 


ggagccagca 


aaagacccgc 


ccccgatgac 


gcggatataa 


gcgagcccaa 


gcgggcctgc 


1860 


ccctcagtcg 


cggatccatc 


gacgtcagac 


gcggaaggag 


ctccggtgga 


ctttgccgac 


1920 


aggtaccaaa 


acaaatgttc 


tcgtcacgcg 


ggcatgattc 


agatgctgtt 


tccctgcaaa 


1980 


acgtgcgaga 


gaatgaatca 


gaatttcaac 


atttgcttca 


cacacggggt 


cagagactgt 


2040 


ttagagtgtt 


tccccggcgt 


gtcagaatct 


caaccggtcg 


tcagaaaaaa 


gacgtatcgg 


2100 


aaactctgcg 


cgattcatca 


tctgctgggg 


cgggcgcccg 


agattgcttg 


ctcggcctgc 


2160 


gacctggtca 


acgtggacct 


ggacgactgc 


gtttctgagc 


aataaatgac 


ttaaaccagg 


2220 


tatggctgcc 


gatggttatc 


ttccagattg 


gctcgaggac 


aacctctctg 


agggcattcg 


2280 


cgagtggtgg 


gacctgaaac 


ctggagcccc 


gaaacccaaa 


gccaaccagc 


aaaagcagga 


2340 


caacggccgg 


ggtctggtgc 


ttcctggcta 


caagtacctc 


ggacccttca 


acggactcga 


2400 


caagggggag 


cccgtcaacg 


cggcggacgc 


agcggccctc 


gagcacgaca 


aggcctacga 


2460 


ccagcagctc 


aaagcgggtg 


acaatccgta 


cctgcggtat 


aaccacgccg 


acgccgagtt 


2520 


tcaggagcgt 


ctgcaagaag 


atacgtcatt 


tgggggcaac 


ctcgggcgag 


cagtcttcca 


2580 


ggccaagaag 


cgggttctcg 


aacctctcgg 


tctggttgag 


gaaggcgcta 


agacggctcc 


2640 


tgcaaagaag 


agaccggtag 


agccgtcacc 


tcagcgttcc 


cccgactcct 


ccacgggcat 


2700 


cggcaagaaa 


ggccagcagc 


ccgccagaaa 


gagactcaat 


ttcggtcaga 


ctggcgactc 


2760 


agagtcagtc 


cccgaccctc 


aacctctcgg 


agaacctcca 


gcagcgccct 


ctagtgtggg 


2820 
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atctggtaca 


gtggctgcag 


gcggtggcgc 


accaatggca 


gacaataacg 


aaggtgccga 


2880 


cggagtgggt 


aatgcctcag 


gaaattggca 


ttgcgattcc 


acatggctgg 


gcgacagagt 


2940 


cattaccacc 


agcacccgaa 


cctgggccct 


gcccacctac 


aacaaccacc 


tctacaagca 


3000 


aatctccagt 


gaaactgcag 


gtagtaccaa 


cgacaacacc 


tacttcggct 


acagcacccc 


3060 


ctgggggtat 


tttgacttta 


acagattcca 


ctgccacttc 


tcaccacgtg 


actggcagcg 


3120 


actcatcaac 


aacaactggg 


gattccggcc 


caagaagctg 


cggttcaagc 


tcttcaacat 


3180 


ccaggtcaag 


gaggtcacga 


cgaatgacgg 


cgttacgacc 


atcgctaata 


accttaccag 


3240 


cacgattcag 


gtattctcgg 


actcggaata 


ccagctgccg 


tacgtcctcg 


gctctgcgca 


3300 


ccagggctgc 


ctgcctccgt 


tcccggcgga 


cgtcttcatg 


attcctcagt 


acggctacct 


3360 


gactctcaac 


aatggcagtc 


agtctgtggg 


acgttcctcc 


ttctactgcc 


tggagtactt 


3420 


cccctctcag 


atgctgagaa 


cgggcaacaa 


ctttgagttc 


agctacagct 


tcgaggacgt 


3480 


gcctttccac 


agcagctacg 


cacacagcca 


gagcctggac 


cggctgatga 


atcccctcat 


3540 


cgaccagtac 


ttgtactacc 


tggccagaac 


acagagtaac 


ccaggaggca 


cagctggcaa 


3600 


tcgggaactg 


cagttttacc 


agggcgggcc 


ttcaac tatg 


gccgaacaag 


ccaagaat tg 


3660 


gttacctgga 


ccttgcttcc 


ggcaacaaag 


agtctccaaa 


acgctggat c 


aaaacaacaa 


3720 


cagcaacttt 


gcttggactg 


gtgccaccaa 


atatcacctg 


aacggcagaa 


actcgttggt 


3780 


taatcccggc 


gtcgccatgg 


caactcacaa 


ggacgacgag 


gaccgctttt 


tcccatccag 


3840 


cggagtcctg 


atttttggaa 


aaactggagc 


aactaacaaa 


actacattgg 


aaaatgtgtt 


3900 


aatgacaaat 


gaagaagaaa 


ttcgtcctac 


taatcctgta 


gccacggaag 


aatacgggat 


3960 


agtcagcagc 


aacttacaag 


cggctaatac 


tgcagcccag 


acacaagttg 


tcaacaacca 


4020 


gggagcctta 


cctggcatgg 


tctggcagaa 


ccgggacgtg 


tacctgcagg 


gtcccatctg 


4080 


ggccaagatt 


cctcacacgg 


atggcaactt 


tcacccgtct 


cctttgatgg 


gcggctttgg 


4140 


acttaaacat 


ccgcctcctc 


agatcctgat 


caagaacact 


cccgttcccg 


ctaatcctcc 


4200 


ggaggtgttt 


actcctgcca 


agtttgcttc 


gttcatcaca 


cagtacagca 


ccggacaagt 


4260 


cagcgtggaa 


atcgagtggg 


agctgcagaa 


ggaaaacagc 


aagcgctgga 


acccggagat 


4320 


tcagtacacc 


tccaactttg 


aaaagcagac 


tggtgtggac 


tttgccgttg 


acagccaggg 


4380 


tgtttactct 


gagcctcgcc 


ctattggcac 


tcgttacctc 


acccgtaatc 


tgtaattgca 


4440 


tgttaatcaa 


taaaccggtt 


gattcgtttc 


agttgaactt 


tggtctcctg 


tgcttcttat 


4500 


cttatcggtt 


tccatagcaa 


ctggttacac 


attaactgct 


tgggtgcgct 


tcacgataag 


4560 


aacactgacg 


tcaccgcggt 


acccctagtg 


atggagttgg 


ccactccctc 


tatgcgcgct 


4620 


cgctcgctcg 


gtggggcctg 


cggaccaaag 


gtccgcagac 


ggcagagctc 


tgctctgccg 


4680 


gccccaccga 


gcgagcgagc 


gcgcatagag 


ggagtggcca 


a 




4721 
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<210> 2 
<211> 737 
<212> PRT 

<213> capsid protein of adeno-associated virus serotpye 7 
<400> 2 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro 
180 185 190 



Pro Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
210 215 220 



Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 
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lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Ser Glu Thr Ala Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn lie 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr lie Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Gly Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala 
435 440 445 



Arg Thr Gin Ser Asn Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin 
450 455 460 



Phe Tyr Gin Gly Gly Pro Ser Thr Met Ala Glu Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp 
485 490 495 



46 



EP1 310 571 A2 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asn Ser leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie 
530 535 540 



Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn Val Leu 
545 550 555 560 



Met Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu 
565 570 575 



Glu Tyr Gly lie Val Ser Ser Asn Leu Gin Ala Ala Asn Thr Ala Ala 
580 585 590 



Gin Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp 
595 600 605 



Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro Tie Trp Ala Lys lie Pro 
610 615 620 



His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 
625 630 635 640 



Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro 
645 650 655 



Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe lie 
660 665 670 



Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu 
675 680 685 



Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser 
690 695 700 



Asn Phe Glu Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly 
705 710 715 720 



Val Tyr Ser Glu Pro Arg Pro Tie Gly Thr Arg Tyr Leu Thr Arg Asn 
725 730 735 



Leu 



47 



EP1 310 571 A2 



<210> 3 
<211> 623 
<212> PRT 

<213> rep protein of adeno-associated virus serotype 7 
<400> 3 

Met Pro Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 
15 10 15 



Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
20 25 30 



Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 
35 40 45 



Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 



Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 



Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His Val Leu Val Glu 
85 90 95 



Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 
100 105 110 



Arg Glu Lys Leu Val Gin Thr lie Tyr Arg Gly Val Glu Pro Thr Leu 
115 120 125 



Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 



Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 



Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr lie 
165 170 175 



Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 



Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 
195 200 205 



Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 



Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 
225 230 235 240 
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Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 
245 250 255 



Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 



He Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ser 
275 280 285 



Leu Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
290 295 300 



Asn Gly Tyr Asp Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 



Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
325 330 335 



Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala Val Pro 
340 345 350 



Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
355 360 355 



Cys val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
370 375 380 



Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 



Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
405 410 415 



He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
420 425 430 



Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 



Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 
450 455 460 



Glu Val Lys Glu Phe Phe Arg Trp Ala Ser Asp His Val Thr Glu Val 
465 470 475 480 



Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Ser Lys Arg Pro Ala 
485 490 495 
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Pro Asp Asp Ala Asp He Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 
500 505 510 

5 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 
515 520 525 

Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met He Gin Met 
530 535 540 

Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn He 
545 550 555 560 

15 

Cys Phe Thr His Gly Val Arg Asp Cys Leu Glu Cys Phe Pro Gly Val 
565 570 575 

20 Ser Glu Ser Gin Pro Val Val Arg Lys Lys Thr Tyr Arg Lys Leu Cys 

580 585 590 

Ala He His His Leu Leu Gly Arg Ala Pro Glu He Ala Cys Ser Ala 
595 600 605 

25 

Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu Gin 



<211> 4393 
<212> DNA 

<213> adeno-associated virus serotype 8 
<400> 4 

cagagaggga gtggccaact ccatcactag gggtagcgcg aagcgcctcc cacgctgccg 60 

cgtcagcgct gacgtaaatt acgtcatagg ggagtggtcc tgtattagct gtcacgtgag 120 

tgcttttgcg gcattttgcg acaccacgtg gccatttgag gtatatatgg ccgagtgagc 180 

gagcaggatc tccattttga ccgcgaaatt tgaacgagca gcagccatgc cgggcttcta 240 

cgagatcgtg atcaaggtgc cgagcgacct ggacgagcac ctgccgggca tttctgactc 300 

gtttgtgaac tgggtggccg agaaggaatg ggagctgccc ccggattctg acatggatcg 360 

gaatctgatc gagcaggcac ccctgaccgt ggccgagaag ctgcagcgcg acttcctggt 420 

ccaatggcgc cgcgtgagta aggccccgga ggccctcttc t':;tgttcagt tcgagaaggg 480 

cgagagctac tttcacctgc acgttctggt cgagaccacg ggggtcaagt ccatggtgct 540 

aggccgcttc ctgagtcaga ttcgggaaaa gcttggtcca gaccatctac ccgcggggtc 600 

gagccccacc ttgcccaact ggttcgcggt gaccaaagac gcggtaatgg cgccggcggg 660 

ggggaacaag gtggtggacg agtgctacat ccccaactac ctcctgccca agactcagcc 720 

cgagctgcag tgggcgtgga ctaacatgga ggagtatata agcgcgtgct tgaacctggc 780 
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cgagcgcaaa 


cggctcgtgg 


cgcagcacct 


gacccacgtc 


agccagacgc 


aggagcagaa 


840 


5 


caaggagaat 


ctgaacccca 


attctgacgc 


gcccgtgatc 


aggtcaaaaa 


cctccgcgcg 


900 




ctatatggag 


ctggtcgggt 


ggctggtgga 


ccggggcatc 


acctccgaga 


agcagtggat 


960 




ccaggaggac 


caggcctcgt 


acatctcctt 


caacgccgcc 


tccaactcgc 


ggtcccagat 


1020 


10 


caaggccgcg 


ctggacaatg 


ccggcaagat 


catggcgctg 


accaaatccg 


cgcccgacta 


1080 




cctggtgggg 


ccctcgctgc 


ccgcggacat 


tacccagaac 


cgcatctacc 


gcatcctcgc 


1140 




tctcaacggc 


tacgaccctg 


cctacgccgg 


ctccgtcttt 


ctcggctggg 


ctcagaaaaa 


1200 


15 


gttcgggaaa 


cgcaacacca 


tctggctgtt 


tggacccgcc 


accaccggca 


agaccaacat 


1260 




tgcggaagcc 


atcgcccacg 


ccgtgccctt 


ctacggctgc 


gtcaactgga 


ccaatgagaa 


1320 




ctttcccttc 


aatgattgcg 


tcgacaagat 


ggtgatctgg 


tgggaggagg 


gcaagatgac 


1380 


20 


ggccaaggtc 


gtggagtccg 


ccaaggccat 


tctcggcggc 


agcaaggtgc 


gcgtggacca 


1440 




aaagtgcaag 


tcgtccgccc 


agatcgaccc 


cacccccgtg 


atcgtcacct 


ccaacaccaa 


1500 




catgtgcgcc 


gtgattgacg 


ggaacagcac 


caccttcgag 


caccagcagc 


ctctccagga 


1560 


25 


ccggatgttt 


aagttcgaac 


tcacccgccg 


tctggagcac 


gactttggca 


aggtgacaaa 


1620 




gcaggaagtc 


aaagagttct 


tccgctgggc 


cagtgatcac 


gtgaccgagg 


tggcgcatga 


1680 




gttttacgtc 


agaaagggcg 


gagccagcaa 


aagacccgcc 


cccgatgacg 


cggataaaag 


1740 


30 


cgagcccaag 


cgggcctgcc 


cctcagtcgc 


ggatccatcg 


acgtcagacg 


cggaaggagc 


1800 




tccggtggac 


tttgccgaca 


ggtaccaaaa 


caaatgttct 


cgtcacgcgg 


gcatgcttca 


1860 




gatgctgttt 


ccctgcaaaa 


cgtgcgagag 


aatgaatcag 


aatttcaaca 


tttgcttcac 


1920 


35 


acacggggtc 


agagactgct 


cagagtgttt 


ccccggcgtg 


tcagaatctc 


aaccggtcgt 


1980 




cagaaagagg 


acgtatcgga 


aactctgtgc 


gattcatcat 


ctgctgcggc 


gggctcccga 


2040 




gattgcttgc 


tcggcctgcg 


atctggtcaa 


cgtggacctg 


gatgactgtg 


tttctgagca 


2100 


40 


ataaatgact 


taaaccaggt 


atggctgccg 


atggttatct 


tccagattgg 


ctcgaggaca 


2160 




acctctctga 


gggcattcgc 


gagtggtggg 


cgctgaaacc 


tggagccccg 


aagcccaaag 


2220 




ccaaccagca 


aaagcaggac 


gacggccggg 


gtctggtgct 


tcctggctac 


aagtacctcg 


2280 


45 


gacccttcaa 


cggactcgac 


aagggggagc 


ccgtcaacgc 


ggcggacgca 


gcggccctcg 


2340 




agcacgacaa 


ggcctacgac 


cagcagctgc 


aggcgggtga 


caatccgtac 


ctgcggtata 


2400 




accacgccga 


cgccgagttt 


caggagcgtc 


tgcaagaaga 


tacgt ctttt 


gggggcaacc 


2460 


50 


tcgggccagc 


agtcttccag 


gccaagaagc 


gggttctcga 


acctctcggt 


ctggtt gagg 


2520 




aaggcgc taa 


gacggctcct 


ggaaagaaga 


caccggtaga 


gccat caccc 


cagcgt tctc 


2580 




cagac tcctc 


tacgggcatc 


ggcaagaaag 


gccaacagcc 


cgccagaaaa 


agactcaatt 


2 640 


55 


ttggtcagac 


tggcgactca 


gagtcagttc 


cagaccctca 


acctctcgga 


gaacctccag 


2700 
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cagcgccctc 


tggtgtggga 


cctaatacaa 


tggctgcagg 


cggtggcgca 


ccaatggcag 


2760 


5 


acaataacga 


aggcgccgac 


ggagtgggta 


gttcctcggg 


aaattggcat 


tgcgattcca 


2820 




catggctggg 


cgacagagtc 


atcaccacca 


gcacccgaac 


ctgggccctg 


cccacctaca 


2880 




acaaccacct 


ctacaagcaa 


atctccaacg 


ggacatcggg 


aggagccacc 


aacgacaaca 


2940 


10 


cctacttcgg 


ctacagcacc 


ccctgggggt 


attttgactt 


taacagattc 


cactgccact 


3000 




tttcaccacg 


tgactggcag 


cgactcatca 


acaacaactg 


gggattccgg 


cccaagagac 


3060 




tcagcttcaa 


gctcttcaac 


atccaggtca 


aggaggtcac 


gcagaatgaa 


ggcaccaaga 


3120 


15 


ccatcgccaa 


taacctcacc 


agcaccatcc 


aggtgtttac 


ggactcggag 


taccagctgc 


3180 




cgtacgttct 


cggctctgcc 


caccagggct 


gcctgcctcc 


gttcccggcg 


gacgtgttca 


3240 




tgattcccca 


gtacggctac 


ctaacactca 


acaacggtag 


tcaggccgtg 


ggacgctcct 


3300 


20 


ccttctactg 


cctggaatac 


tttccttcgc 


agatgctgag 


aaccggcaac 


aacttccagt 


3360 




ttacttacac 


cttcgaggac 


gtgcctttcc 


acagcagcta 


cgcccacagc 


cagagcttgg 


3420 




accggctgat 


gaatcctctg 


attgaccagt 


acctgtacta 


cttgtctcgg 


actcaaacaa 


3480 


25 


caggaggcac 


ggcaaatacg 


cagactctgg 


gcttcagcca 


aggtgggcct 


aatacaatgg 


3540 




ccaatcaggc 


aaagaactgg 


ctgccaggac 


cctgttaccg 


ccaacaacgc 


gtctcaacga 


3600 




caaccgggca 


aaacaacaat 


agcaactttg 


cctggactgc 


tgggaccaaa 


taccatctga 


3660 


30 


atggaagaaa 


ttcattggct 


aatcctggca 


tcgctatggc 


aacacacaaa 


gacgacgagg 


3720 




agcgtttttt 


tcccagtaac 


gggatcctga 


tttttggcaa 


acaaaatgct 


gccagagaca 


3780 




atgcggatta 


cagcgatgtc 


atgctcacca 


gcgaggaaga 


aatcaaaacc 


actaaccctg 


3840 


35 


tggctacaga 


ggaatacggt 


atcgtggcag 


ataacttgca 


gcagcaaaac 


acggctcctc 


3900 




aaattggaac 


tgtcaacagc 


cagggggcct 


tacccggtat 


ggtctggcag 


aaccgggacg 


3960 




tgtacctgca 


gggtcccatc 


tgggccaaga 


ttcctcacac 


ggacggcaac 


ttccacccgt 


4020 


40 


ctccgctgat 


gggcggcttt 


ggcctgaaac 


atcctccgcc 


tcagatcctg 


atcaagaaca 


4080 




cgcctgtacc 


tgcggatcct 


ccgaccacct 


tcaaccagtc 


aaagctgaac 


tctttcatca 


4140 




cgcaatacag 


caccggacag 


gtcagcgtgg 


aaattgaatg 


ggagctgcag 


aaggaaaaca 


4200 


45 


gcaagcgctg 


gaaccccgag 


atccagtaca 


cctccaacta 


ctacaaatct 


acaagtgtgg 


4260 




actttgctgt 


taatacagaa 


ggcgtgtact 


ctgaaccccg 


ccccattggc 


acccgttacc 


4320 




tcacccgtaa 


tctgtaattg 


cctgttaatc 


aataaaccgg 


ttgattcgtt 


tcagttgaac 


4380 




tttggtctct 


gcg 
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<400> 5 



5 


cagagaggga 


gtggccaact 


ccatcactag 


gggtaatcgc 


gaagcgcctc 


ccacgc tgcc 


60 




gcgtcagcgc 


tgacgtagat 


t acgt catag 


gggagtggtc 


c tgtatt age 


tgtcacgtga 


12 0 




gtgcttttgc 


gacattttgc 


gacaccacat 


ggccattt ga 


ggtatatatg 


gccgagtgag 


18 0 


10 


cgagcaggat 


ctccattttg 


accgcgaaat 


ttgaacgagc 


agcagccatg 


ccgggcttct 


24 0 


acgagattgt 


gatcaaggtg 


ccgagcgacc 


tggacgagca 


cctgccgggc 


atttct gact 


300 




cttttgtgaa 


ctgggtggcc 


gagaaggaat 


cggagctgcc 


cccggattct 


gacatcgatc 


360 


15 


ggaatct gat 


cgagcaggca 


cccct gaccg 


tggccgagaa 


gctgcagcgc 


gacttcctgg 


42 0 


tccaatggcc 


ccgcgtgagt 


aaggccccgg 


aggccctctt 


ctttgttcag 


ttcgagaagg 


48 0 




gcgagagcta 


ctttcacctg 


cacgttctgg 


tcgagaccac 


gggggtcaag 


tccatggtgc 


54 0 


20 


taggccgctt 


cctgagtcag 


attcgggaga 


agctggtcca 


gaccatctac 


cgcgggatcg 


600 


agccgaccct 


gcccaactgg 


ttcgcggtga 


ccaagacgcg 


taatggcgcc 


ggcgggggga 


660 




acaaggtggt 


ggacgagtgc 


tacatcccca 


ac tacctcct 


gcccaagact 


cagcccgagc 


720 


25 


tgcagtgggc 


gtggactaac 


atggaggagt 


atataagcgc 


gtgcttgaac 


ctggccgagc 


780 


gcaaacggct 


cgtggcgcag 


cacctgaccc 


acgtcagcca 


gacgcaggag 


cagaacaagg 


840 




agaatct gaa 


ccccaattct 


gacgcgcccg 


tgatcaggtc 


aaaaacctcc 


gcgcgctaca 


900 




tggagctggt 


cgggtggctg 


gtggaccggg 


gcatcacctc 


cgagaagcag 


tggatccagg 


960 


30 


aggaccaggc 


ctcgtacatc 


tccttcaacg 


ccgcctccaa 


ctcgcggtcc 


cagatcaagg 


1020 




ccgcgct gga 


caatgccggc 


aagat catgg 


cgctgaccaa 


atccgcgccc 


gactacctgg 


1080 




taggccc tt c 


actt ccggtg 


gacattacgc 


agaaccgcat 


ctaccgcat c 


ctgcagc tea 


114 0 


35 


acggctacga 


ccct gcctac 


gccggctccg 


tc tttctcgg 


c tgggcacaa 


aagaagttcg 


1200 




ggaaacgcaa 


caccatct gg 


ctgtttgggc 


cggccaccac 


gggaaagacc 


aacatcgcag 


12 60 




aagccat t gc 


ccacgccg tg 


cccttctacg 


gctgcgtcaa 


ctggaccaat 


gagaactttc 


132 0 


40 


cc ttcaacga 


ttgcgtcgac 


aagatggtga 


tctggtggga 


ggagggcaag 


atgacggcca 


1380 




aggtcgtgga 


gtccgccaag 


gccattctcg 


gcggcagcaa 


ggtgcgcgtg 


gaccaaaagt 


1440 




gcaagtcgtc 


cgcccagatc 


gaccccactc 


ccgtgatcgt 


cacctccaac 


accaacatgt 


1500 


45 


gcgccgtgat 


tgacgggaac 


agcaccacct 


tcgagcacca 


gcagcctctc 


caggaccgga 


1560 




tgtttaagtt 


cgaactcacc 


cgccgtctgg 


agcacgactt 


tggcaaggtg 


acaaagcagg 


1620 




aagtcaaaga 


gttcttccgc 


tgggccagtg 


atcacgtgac 


cgaggtggcg 


catgagtttt 


1680 


50 


acgtcagaaa 


gggcggagcc 


agcaaaagac 


ccgcccccga 


tgacgcggat 


aaaagcgagc 


1740 




ccaagcgggc 


ctgcccctca 


gtcgcggatc 


catcgacgtc 


agacgcggaa 


ggagctccgg 


1800 




tggactttgc 


cgacaggtac 


caaaacaaat 


gttctcgtca 


cgcgggcatg 


cttcagatgc 


1860 


55 


tgcttccctg 


caaaacgtgc 


gagagaatga 


atcagaattt 


caacatttgc 


ttcacacacg 


1920 
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5 gggtcagaga ctgctcagag tgtttccccg 

agaggacgta tcggaaactc tgtgcgattc 
cttgctcggc ctgcgatctg gtcaacgtgg 
tgacttaaac caggtatggc tgccgatggt 
tctgagggca ttcgcgagtg gtgggcgctg 
cagcaaaagc aggacgacgg ccggggtctg 
ttcaacggac tcgacaaggg ggagcccgtc 

15 

ggcaaggcct acgaccagca gctgcaggcg 
gccgacgccg agtttcagga gcgtctgcaa 
cgagcagtct tccaggccaa gaagcgggtt 

20 

gctaagacgg ctcctggaaa gaagagaccg 
tcctctacgg gcatcggcaa gaaaggccaa 
cagactggcg actcagagtc agttccagac 

25 

ccctctggtg tgggacctaa tacaatggct 
aacgaaggcg ccgacggagt gggtaattcc 
ctgggggaca gagtcatcac caccagcacc 

30 

cacctctaca agcaaatctc caatggaaca 
tttggctaca gcaccccctg ggggtatttt 
ccacgtgact ggcagcgact catcaacaac 

35 

ttcaagctgt tcaacatcca ggtcaaggag 
gccaataacc ttaccagcac cgtccaggtc 
gtcctaggct ctgcccacca aggatgcctg 

40 

cctcagtacg gctacctgac gctcaacaat 
tactgtctgg aatacttccc ttctcagatg 
tacactttcg aggacgtgcc tttccacagc 

45 

ctgatgaacc ccctcatcga ccagtaccta 
actgggggaa ctcaaacttt ggcattcagc 
gctagaaact gggtacccgg gccttgctac 

50 

caaaataaca acagcaactt tgcgtggacg 
gactcgctaa tgaatcctgg cgtggctatg 
tttccatcaa gtggcgttct catat::tggc 
tacagccagg tgctgattac agatgaggaa 



gcgtgtcaga 


atctcaaccg 


gtcgtcagaa 


1980 


atcatctgct 


ggggcgggct 


cccgagattg 


2040 


acctggat ga 


ctgtgtttct 


gagcaataaa 


2100 


tatcttccag 


attggctcga 


ggacaacctc 


2160 


aaacctggag 


ccccgaagcc 


caaagccaac 


2220 


gtgcttcctg 


gctacaagta 


cctcggaccc 


2280 


aacgcggcgg 


acgcagcggc 


cctcgagcac 


2340 


ggtgacaatc 


cgtacctgcg 


gtataaccac 


2400 


gaagatacgt 


ctttt ggggg 


caacct cggg 


2460 


ctcgaacctc 


tcggtctggt 


tgaggaaggc 


2 52 0 


gtagagccat 


caccccagcg 


ttctccagac 


2580 


cagcccgcca 


gaaaaagact 


caattt tggt 


2640 


cctcaacctc 


tcggagaacc 


tccagcagcg 


2700 


gcaggcggtg 


gcgcaccaat 


ggcagacaat 


2760 


tcgggaaatt 


ggcattgcga 


ttccacatgg 


2820 


cgaacctggg 


cattgcccac 


ctacaacaac 


2880 


tcgggaggaa 


gcaccaacga 


caacacctac 


2940 


gacttcaaca 


gattccactg 


ccacttctca 


3000 


aactggggat 


tccggccaaa 


gagactcaac 


3060 


gttacgacga 


acgaaggcac 


caagaccatc 


312 0 


tttacggact 


cggagtacca 


gctaccgtac 


3180 


ccaccgtttc 


ctgcagacgt 


cttcatggtt 


324 0 


ggaagtcaag 


cgttaggacg 


ttcttctttc 


3300 


ctgagaaccg 


gcaacaactt 


tcagttcagc 


3360 


agctacgcac 


acagccagag 


tctagatcga 


3420 


tactacctgg 


tcagaacaca 


gacaactgga 


3480 


caagcaggcc 


ctagctcaat 


ggccaatcag 


3540 


cgtcagcagc 


gcgtctccac 


aaccaccaac 


3600 


ggagctgcta 


aattcaagct 


gaacgggaga 


3660 


gcatcgcaca 


aagacgacga 


ggaccgcttc 


3720 


aagcaaggag 


ccgggaacga 


tggagtcgac 


3780 


gaaattaaag 


ccaccaaccc 


tgtagccaca 


3840 
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5 



10 



15 





gagcagtggc 


catcaacaac 


caggccgcta 


acacgcaggc 


gcaaactgga 


3900 


cttgtgcatia 


accagggagt 


tattcctggt 


atggtctggc 


agaaccggga 


cgtg tacctg 


3960 


c ag^grgccct a 


tttgggctaa 


aatacc tcac 


acagatggca 


actttcaccc 


gtctcctctg 


4020 


?^ i'rfrtrfi'cfcifif' 

d 1.^ u U M ^ M ^ ^ ^ 


ttggactgaa 


acacccacct 


ccacagattc 


t aatt aaaaa 


tacaccagtg 


4080 


ccggcagatc 


ctcctcttac 


cttcaatcaa 


gccaagctga 


actctttcat 


cacgcagtac 


414 0 


agcacgggac 


aagtcagcgt 


ggaaatcgag 


tgggagctgc 


agaaagaaaa 


cagcaagcgc 


4200 


tggaatccag 


agatccagta 


tacttcaaac 


tactacaaat 


ctacaaatgt 


ggactttgct 


4260 


gtcaatacca 


aaggtgttta 


ctctgagcct 


cgccccattg 


gtactcgtta 


cctcacccgt 


4320 


aatttgtaat 


tgcctgttaa 


tcaataaacc 


ggttaattcg 


tttcagttga 


actttggtct 


4380 


ctgcg 
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<400> 6 





ttgcccactc 


cctctctgcg 


cgctcgctcg 


ctcggtgggg 


cctgcggacc 


aaaggtccgc 


60 




agacggcaga 


gctctgctct 


gccggcccca 


ccgagcgagc 


gagcgcgcag 


agagggagtg 


120 


30 


ggcaactcca 


tcactagggg 


taatcgcgaa 


gcgcctccca 


cgctgccgcg 


tcagcgctga 


180 




cgtaaattac 


gtcatagggg 


agtggtcctg 


tattagctgt 


cacgtgagtg 


cttttgcgac 


240 




attttgcgac 


accacgtggc 


catttagggt 


atatatggcc 


gagtgagcga 


gcaggatctc 


300 


35 


cattttgacc 


gcgaaatttg 


aacgagcagc 


agccatgccg 


ggcttctacg 


agatcgtgat 


360 




caaggtgccg 


agcgacctgg 


acgagcacct 


gccgggcatt 


tctgactcgt 


ttgtgagctg 


420 




ggtggccgag 


aaggaatggg 


agctgccccc 


ggattctgac 


atggatctga 


atctgattga 


480 


40 


gcaggcaccc 


ctgaccgtgg 


ccgagaagct: 


gcagcgcgac 


ttcctggtcc 


aatggcgccg 


540 




cgtgagtaag 


gccccggagg 


ccctcttctt 


tgttcagttc 


gagaagggcg 


agtcct actt 


600 




ccacctccat 


attctggtgg 


agaccacggg 


ggtcaaatcc 


atggtgctgg 


gccgcttcct 


660 


45 


gagtcagatt 


agggacaagc 


tggtgcagac 


catctaccgc 


gggatcgagc 


cgaccctgcc 


720 




caactggttc 


gcggtgacca 


agacgcgtaa 


tggcgccgga 


ggggggaaca 


aggtggtgga 


780 




cgagtgctac 


atccccaact 


acctcctgcc 


caagactcag 


cccgagctgc 


agtgggcgtg 


840 


50 


gactaacatg 


gaggagtata 


taagcgcctg 


tttgaacctg 


gccgagcgca 


aacggctcgt 


900 




ggcgcagcac 


ctgacccacg 


tcagccagac 


ccaggagcag 


aacaaggaga 


atctgaaccc 


960 




caattctgac 


gcgcctgtca 


tccggtcaaa 


aacctccgcg 


cgctacatgg 


agctggtcgg 


1020 


55 


gtggctggtg 


gaccggggca 


tcacctccga 


gaagcagtgg 


atccaggagg 


accaggcctc 


1080 
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gtacatctcc 


ttcaacgccg 


cttccaactc 


gcggtcccag 


atcaaggccg 


ctctggacaa 


1140 


5 


tgccggcaag 


atcatggcgc 


tgaccaaatc 


cgcgcccgac 


tacctggtag 


gccccgctcc 


1200 




gcccgcggac 


attaaaacca 


accgcatcta 


ccgcatcctg 


gagctgaacg 


gctacgaacc 


1260 




tgcctacgcc 


ggctccgtct 


ttctcggctg 


ggcccagaaa 


aggttcggga 


agcgcaacac 


1320 


10 


catctggctg 


tttgggccgg 


ccaccacggg 


caagaccaac 


atcgcggaag 


ccatcgccca 


1380 




cgccgtgccc 


ttctacggct 


gcgtcaactg 


gaccaatgag 


aactttccct 


tcaatgattg 


1440 




cgtcgacaag 


atggtgatct 


ggtgggagga 


gggcaagatg 


acggccaagg 


tcgtggagtc 


1500 


15 


cgccaaggcc 


attctcggcg 


gcagcaaggt 


gcgcgtggac 


caaaagtgca 


agtcgtccgc 


1560 




ccagatcgac 


cccacccccg 


tgatcgtcac 


ctccaacacc 


aacatgtgcg 


ccgtgattga 


1620 




cgggaacagc 


accaccttcg 


agcaccagca 


gccgttgcag 


gaccggatgt 


tcaaatttga 


1680 


20 


actcacccgc 


cgtctggagc 


atgactttgg 


caaggtgaca 


aagcaggaag 


tcaaagagtt 


1740 




cttccgctgg 


gcgcaggatc 


acgtgaccga 


ggtggcgcat 


gagttctacg 


tcagaaaggg 


1800 




tggagccaac 


aaaagacccg 


cccccgatga 


cgcggataaa 


agcgagccca 


agcgggcctg 


1860 


25 


cccctcagtc 


gcggatccat 


cgacgtcaga 


cgcggaagga 


gctccggtgg 


actttgccga 


1920 




caggtaccaa 


aacaaatgtt 


ctcgtcacgc 


gggcatgctt 


cagatgctgt 


ttccctgcaa 


1980 




gacatgcgag 


agaatgaatc 


agaatttcaa 


catttgcttc 


acgcacggga 


cgagagactg 


2040 


30 


ttcagagtgc 


ttccccggcg 


tgtcagaatc 


tcaaccggtc 


gtcagaaaga 


ggacgtatcg 


2100 




gaaactctgt 


gccattcatc 


atctgctggg 


gcgggctccc 


gagattgctt 


gctcggcctg 


2160 




cgatctggtc 


aacgtggacc 


tggatgactg 


tgtttctgag 


caataaatga 


cttaaaccag 


2220 


35 


gtatggctgc 


cgatggttat 


cttccagatt 


ggctcgagga 


caacctctct 


gagggcattc 


2280 




gcgagtggtg 


ggacttgaaa 


cctggagccc 


cgaagcccaa 


agccaaccag 


caaaagcagg 


2340 




acgacggccg 


gggtctggtg 


cttcctggct 


acaagtacct 


cggacccttc 


aacggactcg 


2400 


40 


acaaggggga 


gcccgtcaac 


gcggcggacg 


cagcggccct 


cgagcacgac 


aaggcctacg 


2460 




accagcagct 


caaagcgggt 


gacaatccgt 


acctgcggta 


taaccacgcc 


gacgccgagt 


2520 




ttcaggagcg 


tctgcaagaa 


gatacgtctt 


ttgggggcaa 


cctcgggcga 


gcagtcttcc 


2580 


45 


aggccaagaa 


gcgggttctc 


gaacctctcg 


gtctggttga 


ggaaggcgct 


aagacggctc 


2640 




ctggaaagaa 


acgtccggta 


gagcagtcgc 


cacaagagcc 


agactcctcc 


tcgggcatcg 


2700 




gcaagacagg 


ccagcagccc 


gctaaaaaga 


gactcaattt 


tggtcagact 


ggcgactcag 


2760 


50 


agtcagtccc 


cgatccacaa 


cctctcggag 


aacctccagc 


aacccccgct 


gctgtgggac 


2820 




ctactacaat 


ggcttcaggc 


ggtggcgcac 


caatggcaga 


caataacgaa 


ggcgccgacg 


2880 




gagtgggtaa 


tgcctcagga 


aattggcatt 


gcgattccac 


atggctgggc 


gacagagtca 


2940 


55 


tcaccaccag 


cacccgcacc 


tgggccttgc 


ccacctacaa 


taaccacctc 


"acaagcaaa 


3000 
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tctccagtgc 


ttcaacgggg 


gccagcaacg 


acaaccacta 


cttcggctac 


agcaccccct 


3060 


5 




tgatttcaac 


agattccact 


gccacttttc 


accacgtgac 


tggcagcgac 


3120 




tcatcaacaa 


caattgggga 


ttccggccca 


agagactcaa 


cttcaaactc 


ttcaacatcc 


3180 




aagtcaagga 


ggtcacgacg 


aatgatggcg 


tcacaaccat 


cgctaataac 


cttaccagca 


3240 


10 


cggttcaagt 


cttctcggac 


tcggagtacc 


agcttccgta 


cgtcctcggc 


tctgcgcacc 


3300 




agggctgcct 


ccctccgttc 


ccggcggacg 


tgttcatgat 


tccgcaatac 


ggctacctga 


3360 




cgctcaacaa 


tggcagccaa 


gccgtgggac 


gttcatcctt 


ttactgcctg 


gaatatttcc 


3420 


15 


cttctcagat 


gctgagaacg 


ggcaacaact 


ttaccttcag 


ctacaccttt 


gaggaagtgc 


3480 




ctttccacag 


cagctacgcg 


cacagccaga 


gcctggaccg 


gctgatgaat 


cctctcatcg 


3540 




accaatacct 


gtattacctg 


aacagaactc 


aaaatcagtc 


cggaagtgcc 


caaaacaagg 


3600 


20 


acttgctgtt 


tagccgtggg 


tctccagctg 


gcatgtctgt 


tcagcccaaa 


aactggctac 


3660 




ctggaccctg 


ttatcggcag 


cagcgcgttt 


ctaaaacaaa 


aacagacaac 


aacaacagca 


3720 




attttacctg 


gactggtgct 


tcaaaatata 


acctcaatgg 


gcgtgaatcc 


atcatcaacc 


3780 


25 


ctggcactgc 


tatggcctca 


cacaaagacg 


acgaagacaa 


gttctttccc 


atgagcggtg 


3840 




t catgatttt 


tggaaaagag 


agcgccggag 


cttcaaacac 


tgcattggac 


aatgtcatga 


3900 




ttacagacga 


agaggaaatt 


aaagccacta 


accctgtggc 


caccgaaaga 


tt tgggaccg 


3960 


30 


t ggcagtcaa 


tttccagagc 


agcagcacag 


accctgcgac 


cggagatgtg 


catgctatgg 


4020 




gagcattacc 


tggcatggtg 


tggcaagata 


gagacgtgta 


cctgcagggt 


cccatttggg 


4080 




ccaaaattcc 


tcacacagat 


ggacactttc 


acccgtctcc 


tcttatgggc 


ggctttggac 


4140 


35 


tcaagaaccc 


gcctcctcag 


atcctcatca 


aaaacacgcc 


tgttcctgcg 


aatcctccgg 


4200 




cggagttttc 


agctacaaag 


tttgcttcat 


tcatcaccca 


atactccaca 


ggacaagtga 


4260 




gtgtggaaat 


tgaatgggag 


ctgcagaaag 


aaaacagcaa 


gcgctggaat 


cccgaagtgc 


4320 


40 


agtacacatc 


caattatgca 


aaatctgcca 


acgttgattt 


tactgtggac 


aacaatggac 


4380 




tttatactga 


gcctcgcccc 


attggcaccc 


gttaccttac 


ccgtcccctg 


taattacgtg 


4440 




ttaatcaata 


aaccggttga 


ttcgtttcag 


ttgaactttg 


gtctcctgtc 


cttcttatct 


4500 


45 


tatcggttac 


catggttata 


gcttacacat 


taactgcttg 


gttgcgcttc 


gcgataaaag 


4560 




acttacgtca 


tcgggttacc 


cctagtgatg 


gagttgccca 


ctccctctct 


gcgcgctcgc 


4620 




tcgctcggtg 


gggcctgcgg 


accaaaggtc 


cgcagacggc 


agagctctgc 


tctgccggcc 


4680 


50 


ccaccgagcg 


agcgagcgcg 


cagagaggga 


gtgggcaa 






4718 
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<400> 7 
ttggccactc 


cctctctgcg 


cgctcgctcg 


ctcactgagg 


ccgggcgacc 


aaaggtcgcc 


60 


cgacgcccgg 


gctttgcccg 


ggcggcctca 


gtgagcgagc 


gagcgcgcag 


agagggagtg 


12 0 


gccaactcca 


tcactagggg 


ttcctggagg 


ggtggagtcg 


tgacgtgaat 


tacgtcatag 


180 


ggttagggag 


gtcctgtatt 


agaggtcacg 


tgagtgtttt 


gcgacatttt 


gcgacaccat 


240 


gtggtcacgc 


tgggtattta 


agcccgagtg 


agcacgcagg 


gtctccattt 


tgaagcggga 


300 


ggtttgaacg 


cgcagccgcc 


atgccggggt 


tttacgagat 


tgtgattaag 


gtccccagcg 


360 


accttgacgg 


gcatctgccc 


ggcatttctg 


acagctttgt 


gaactgggtg 


gccgagaagg 


420 


aatgggagtt 


gccgccagat 


tctgacatgg 


atctgaatct 


gattgagcag 


gcacccctga 


480 


c cgtggccga 


gaagctgcag 


cgcgactttc 


tgacggaatg 


gcgccgtgtg 


agtaaggccc 


540 


cggaggccct 


tttctttgtg 


caatttgaga 


agggagagag 


ctacttccac 


atgcacgtgc 


600 


tcgtggaaac 


caccggggtg 


aaatccatgg 


ttttgggacg 


tttcctgagt 


cagattcgcg 


660 


aaaaactgat 


tcagagaatt 


taccgcggga 


tcgagccgac 


tttgccaaac 


tggttcgcgg 


720 


tcacaaagac 


cagaaatggc 


gccggaggcg 


ggaacaaggt 


ggtggatgag 


tgctacatcc 


780 


ccaattactt 


gctccccaaa 


acccagcctg 


agctccagtg 


ggcgtggact 


aatatggaac 


840 


agtatttaag 


cgcctgtttg 


aatctcacgg 


agcgtaaacg 


gttggtggcg 


cagcatctga 


900 


cgcacgtgtc 


gcagacgcag 


gagcagaaca 


aagagaatca 


gaatcccaat 


tctgatgcgc 


960 


cggtgatcag 


atcaaaaact 


tcagccaggt 


acatggagct 


ggtcgggtgg 


ctcgtggaca 


1020 


aggggattac 


ctcggagaag 


cagtggatcc 


aggaggacca 


ggcctcatac 


atctccttca 


1080 


atgcggcctc 


caactcgcgg 


tcccaaatca 


aggctgcctt 


ggacaatgcg 


ggaaagatta 


1140 


tgagcctgac 


taaaaccgcc 


cccgactacc 


tggtgggcca 


gcagcccgtg 


gaggacattt 


1200 


ccagcaatcg 


gatttataaa 


attttggaac 


taaacgggta 


cgatccccaa 


tatgcggctt 


1260 


ccgtctttct 


gggatgggcc 


acgaaaaagt 


tcggcaagag 


gaacaccatc 


tggctgtttg 


1320 


ggcctgcaac 


taccgggaag 


accaacatcg 


cggaggccat 


agcccacact 


gtgcccttct 


1380 


acgggtgcgt 


aaactggacc 


aatgagaact 


ttcccttcaa 


cgactgtgtc 


gacaagatgg 


1440 


tgatctggtg 


ggaggagggg 


aagatgaccg 


ccaaggtcgt 


ggagtcggcc 


aaagccattc 


1500 


tcggaggaag 


caaggtgcgc 


gtggaccaga 


aatgcaagtc 


ctcggcccag 


atagacccga 


1560 


ctcccgtgat 


cgtcacctcc 


aacaccaaca 


tgtgcgccgt 


gattgacggg 


aactcaacga 


1620 


ccttcgaaca 


ccagcagccg 


ttgcaagacc 


ggatgttcaa 


atttgaactc 


acccgccgtc 


1680 


tggatcatga 


ctttgggaag 


gtcaccaagc 


aggaagtcaa 


agactttttc 


cggtgggcaa 


1740 


aggatcacgt 


ggttgaggtg 


gagcatgaat 


tctacgtcaa 


aaagggtgga 


gccaagaaaa 


1800 


gacccgcccc 


cagtgacgca 


gatataagtg 


agcccaaacg 


ggtgcgcgag 


tcagttgcgc 


1860 


agccatcgac 


gtcagacgcg 


gaagcttcga 


tcaactacgc 


agacaggtac 


caaaacaaat 


1920 
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gttctcgtca 


cgtgggcatg 


aatctgatgc 


tgtttccctg 


cagacaatgc 


gagagaatga 


1980 


atcagaattc 


aaatatctgc 


ttcactcacg 


gacagaaaga 


ctgtttagag 


tgctttcccg 


2040 


tgtcagaatc 


tcaacccgtt 


tctgtcgtca 


aaaaggcgta 


tcagaaactg 


tgctacattc 


2100 


atcatatcat 


gggaaaggt g 


ccagacgctt 


gcactgcctg 


cgatctggtc 


aatgtggatt 


2160 


tggat gactg 


catctttgaa 


caataaatga 


tttaaatcag 


gtatggctgc 


cgatggttat 


2220 


cttccagatt 


ggctcgagga 


cactctctct 


gaaggaataa 


gacagtggtg 


gaagctcaaa 


2280 


cctggcccac 


caccaccaaa 


gcccgcagag 


cggcataagg 


acgacagcag 


gggtcttgtg 


2340 


cttcctgggt 


acaagtacct 


cggaccc ttc 


aacggactcg 


acaagggaga 


gccggtcaac 


2400 


gaggcagacg 


ccgcggccct 


cgagcacgta 


caaagcctac 


gaccggcagc 


tcgacagcgg 


2460 


agacaacccg 


tacctcaagt 


acaaccacgc 


cgacgcggag 


tttcaggagc 


gccttaaaga 


2520 


agatacgtct 


tttgggggca 


acctcggacg 


agcagtcttc 


caggcgaaaa 


agagggttct 


2580 


tgaacctctg 


ggcctggttg 


aggaacctgt 


taagacggct 


c;cgggaaaaa 


agaggccggt 


2640 


agagcac tct 


cctataaaac 


cagactcctc 


ctcgggaacc 


ggaaaggcgg 


gccagcagcc 


2700 


tgcaagaaaa 


agattgaatt 


ttggtcagac 


tggagacgca 


gactcagtac 


ctgaccccca 


2760 


gcctctcgga 


cagccaccag 


cagccccctc 


tggtctggga 


actaatacga 


tggctacagg 


2820 


cagtggcgca 


ccaatggcag 


acaataacga 


gggcgccgac 


ggagtgggta 


attcctccgg 


2880 


aaattggcat 


tgcgattcca 


catggatggg 


cgacagagtc 


atcaccacca 


gcacccgaac 


2940 


ctgggccctg 


cccacctaca 


acaaccacct 


ctacaaacaa 


atttccagcc 


aatcaggagc 


3000 


ctcgaacgac 


aatcactact 


ttggctacag 


caccccttgg 


gggtattttg 


acttcaacag 


3060 


attccactgc 


cacttttcac 


cacgtgactg 


gcaaagactc 


atcaacaaca 


actggggatt 


3120 


ccgacccaag 


agactcaact 


tcaagctctt 


taacattcaa 


gtcaaagagg 


tcacgcagaa 


3180 


tgacggtacg 


acgacgattg 


ccaataacct 


taccagcacg 


gttcaggtgt 


ttactgactc 


3240 


ggagtaccag 


ctcccgtacg 


tcctcggctc 


ggcgcatcaa 


ggatgcctcc 


cgccgttccc 


3300 


agcagacgtc 


ttcatggtgc 


cacagtatgg 


atacctcacc 


ctgaacaacg 


ggagtcaggc 


3360 


agtaggacgc 


tcttcatttt 


actgcctgga 


gtactttcct 


tctcagatgc 


tgcgtaccgg 


3420 


aaacaacttt 


accttcagct 


acacttttga 


ggacgttcct 


ttccacagca 


gctacgctca 


3480 


cagccagagt 


ctggaccgtc 


tcatgaatcc 


tctcatcgac 


cagtacctgt 


at tact tgag 


u 


cagaacaaac 


actccaagtg 


gaaccaccac 


gcagtcaagg 


cttcagtttt 


ctcaggccgg 


3600 


agcgagtgac 


attcgggacc 


agtctaggaa 


ctggcttcct 


ggaccctgtt 


accgccagca 


3660 


gcgagtatca 


aagacatctg 


cggataacaa 


caacagtgaa 


tactcgtgga 


ctggagctac 


3720 


caagtaccac 


ctcaatggca 


gagactctct 


ggtgaatccg 


gccatggcaa 


gccacaagga 


3780 


cgatgaagaa 


aagttttttc 


ctcagagcgg 


ggttctcatc 


tttgggaagc 


aaggctcaga 


3840 
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gtgaacat tg 


aaaaggtcat 


gattacagac 


gaagaggaaa 


"cc ggaacaac 


3900 


caatcccgtg 


gctacggagc 


agt atggttc 


tgtatctacc 


aacctccaga 


gaggcaacag 


3960 


ac a age a get 


accgcagatg 


t caacacaca 


aggcgt tctt 


ccaggcatgg 


"^ctggcagga 


4 02 0 


cagagatgtg 


tacctt cagg 


ggcccatctg 


ggcaaagatt 


ccacacacgg 


acggacat tt 


4080 


tcacccctct 


cccctcat gg 




acttaaacac 


cctcc tccac 


agattc teat 


4140 


caagaacacc 


ccggtacctg 


cgaat ccttc 


gaccacct tc 


agt gcggcaa 


agtttgct tc 


4200 


ctt catcaca 


cagtac t cca 


cacrcracaccrcf 


tcagcgtgga 


gat cgagtgg 


gage tgcaga 


4260 


aggaaaacag 


caaacgct gg 


aat cccgaaa 


ttcagtacac 


t tccaactac 


aacaagtctg 


4320 


ttaatcgtgg 


acttaccgtg 


gatactaatg 


gcgtgtattc 


agagcctcgc 


cccattggca 


4380 


ccagatacct 


gactcgtaat 


ctgtaattgc 


ttgttaatca 


ataaaccgtt 


taattcgttt 


4440 


cagttgaact 


ttggtctctg 


cgtatttctt 


tcttatctag 


t ttccatggc 


tacgtagata 


4500 


agtagcatgg 


cgggttaatc 


attaactaca 


aggaacccct 


agtgatggag 


ttggccactc 


4560 


cctctctgcg 


cgctcgctcg 


ctcactgagg 


ccgggcgacc 


aaaggtcgcc 


cgacgcccgg 


4620 


gctttgcccg 


ggcggcctca 


gtgagcgagc 


gagcgcgcag 


agagggagtg 


gccaa 


4675 
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ttggccactc 


cctctatgcg 


cactcgctcg 


ctcggtgggg 


cctggcgacc 


aaaggtcgcc 


60 


35 


agacggacgt 


gctttgcacg 


tccggcccca 


ccgagcgagc 


gagtgcgcat 


agagggagtg 


120 




gccaactcca 


tcactagagg 


tatggcagtg 


acgtaacgcg 


aagcgcgcga 


agcgagacca 


180 




cgcctaccag 


ctgcgtcagc 


agtcaggtga 


cccttttgcg 


acagt ttgcg 


acaccacgtg 


240 


40 


gccgctgagg 


gtatatattc 


t cgagtgagc 


gaaccaggag 


ctccattttg 


accgcgaaat 


300 




ttgaacgagc 


agcagceatg 


ccggggttct 


acgagattgt 


cctgaaggtc 


ecgagtgacc 


360 




tggacgageg 


cctgccgggc 


atttctaact 


cgtttgttaa 


ctgggtggcc 


gagaaggaat 


420 


45 


gggacgtgcc 


gccggattct 


gacatggatc 


cgaatctgat 


tgagcaggca 


cccctgaccg 


480 




tggccgaaaa 


gcttcagcge 


gagttcctgg 


tggagtggcg 


ccgcgtgagt 


aaggccecgg 


540 




aggccctctt 


ttttgteeag 


ttegaaaagg 


gggagaccta 


cttccacctg 


eacgtgctga 


600 


50 


ttgagaccat 


eggggtcaaa 


tccatggtgg 


tcggccgcta 


cgtgagccag 


attaaagaga 


660 




agctggtgac 


ccgcatetac 


cgcggggtcg 


agccgcagct 


tccgaactgg 


ttcgcggtga 


720 




ccaaaacgcg 


aaatggcgcc 


gggggcggga 


acaaggtggt 


ggacgactgc 


tacatcccca 


780 


55 


actacctgct 


ccccaagacc 


cagcccgagc 


tccagtgggc 


gtggactaac 


atggaccagt 


840 
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atttaagcgc 


ctgtttgaat 


ctcgcggagc 


gtaaacggct 


ggtg^gcgcag 


catctgacgc 


9O0 


5 


acgtgtcgca 


gacgcaggag 


cagaacaaag 


agaatcagaa 


ccccaattct 


gacgcgccgg 


960 




tcatcaggtc 


aaaaacctca 


gccaggtaca 


tggagctggt 


cgggtggctg 


gtggaccgcg 


1020 




ggatcacgtc 


agaaaagcaa 


tggattcagg 


aggaccaggc 


ctcgtacatc 


tccttcaacg 


1080 


10 


ccgcctccaa 


ctcgcggtcc 


cagatcaagg 


ccgcgctgga 


caatgcctcc 


aagatcatga 


1140 




gcctgacaaa 


gacggctccg 


gactacctgg 


tgggcagcaa 


cccgccggag 


gacattacca 


1200 




aaaatcggat 


ctaccaaatc 


ctggagctga 


acgggtacga 


tccgcagtac 


gcggcctccg 


1260 


15 


tcttcctggg 


ctgggcgcaa 


aagaagttcg 


ggaagaggaa 


caccatctgg 


ctctttgggc 


1320 




cggccacgac 


gggtaaaacc 


aacatcgcgg 


aagccatcgc 


ccacgccgtg 


cccttctacg 


1380 




gctgcgtaaa 


ctggaccaat 


gagaactttc 


ccttcaacga 


ttgcgtcgac 


aagatggtga 


1440 


20 


tctggtggga 


ggagggcaag 


atgacggcca 


aggtcgtgga 


gagcgccaag 


gccattctgg 


1500 




gcggaagcaa 


ggtgcgcgtg 


gaccaaaagt 


gcaagtcatc 


ggcccagatc 


gaacccactc 


1560 




ccgtgatcgt 


cacctccaac 


accaacatgt 


gcgccgtgat 


tgacgggaac 


agcaccacct 


1620 


25 


tcgagcatca 


gcagccgctg 


caggaccgga 


tgtttgaatt 


tgaacttacc 


cgccgtttgg 


1680 




accatgactt 


tgggaaggtc 


accaaacagg 


aagtaaagga 


ctttttccgg 


tgggcttccg 


1740 




atcacgtgac 


tgacgtiggct 


catgagttct 


acgtcagaaa 


gggtggagct 


aagaaacgcc 


1800 


30 


ccgcctccaa 


tgacgcggat 


gtaagcgagc 


caaaacggga 


gtgcacgtca 


cttgcgcagc 


1860 




cgacaacgtc 


agacgcggaa 


gcaccggcgg 


actacgcgga 


caggtaccaa 


aacaaatgtt 


1920 




ctcgtcacgt 


gggcatgaat 


ctgatgcttt 


ttccctgtaa 


aacatgcgag 


agaatgaatc 


1980 


35 


aaatttccaa 


tgtctgtttt 


acgcatggtc 


aaagagactg 


tgcggaatgc 


ttccctggaa 


2040 




tgtcagaatc 


tcaacccgtt 


tctgtcgtca 


aaaagaagac 


ttatcagaaa 


ctgtgtccaa 


2100 




ttcatcatat 


cctgggaagg 


gcacccgaga 


ttgcctgttc 


ggcctgcgat 


ttggccaatg 


2160 


40 


tggacttgga 


tgactgtgtt 


tctgagcaat 


aaatgactta 


aaccaggtat 


ggctgctgac 


2220 




ggttatcttc 


cagattggct 


cgaggacaac 


ctttctgaag 


gcattcgtga 


gtggtgggct 


2280 




ctgaaacctg 


gagtccctca 


acccaaagcg 


aaccaacaac 


accaggacaa 


ccgtcggggt 


2340 


45 


cttgtgcttc 


cgggttacaa 


atacctcgga 


cccggtaacg 


gac tcgacaa 


aggagagccg 


2400 




gtcaacgagg 


cggacgcggc 


agccctcgaa 


cacgacaaag 


cttacgacca 


gcagctcaag 


2460 




gccggtgaca 


acccgtacct 


caagtacaac 


cacgccgacg 


ccgagtttca 


ggagcgtctt 


2520 


50 


caagaagata 


cgtcttttgg 


gggcaacctt 


ggcagagcag 


tcttccaggc 


caaaaagagg 


2580 




atccttgagc 


ctcttggtct 


ggttgaggaa 


gcagctaaaa 


cggctcctgg 


aaagaagggg 


2640 




gctgtagatc 


agtctcctca 


ggaaccggac 


tcatcatctg 


gtgttggcaa 


atcgggcaaa 


2700 


55 


cagcctgcca 


gaaaaagact 


aaatttcggt 


cagactggag 


actcagagtc 


agtcccagac 


2760 
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cctcaacctc 


tcggagaacc 


accagcagcc 


cccacaagtt 


tgggatctaa 


tacaatggct 


2820 


5 


tcaggcggtg 


gcgcaccaat 


ggcagacaat 


aacgagggtg 


ccgatggagt 


gggtaattcc 


2880 




tcaggaaatt 


ggcattgcga 


ttcccaatgg 


ctgggcgaca 


gagtcatcac 


caccagcacc 


2940 




agaacctggg 


ccctgcccac 


ttacaacaac 


catctctaca 


agcaaatctc 


cagccaatca 


3000 


10 


ggagcttcaa 


acgacaacca 


ctactttggc 


tacagcaccc 


cttgggggta 


ttttgacttt 


3060 




aacagattcc 


actgccactt 


ctcaccacgt 


gactggcagc 


gactcattaa 


caacaactgg 


3120 




ggattccggc 


ccaagaaact 


cagcttcaag 


ctcttcaaca 


tccaagttag 


aggggtcacg 


3180 


15 


cagaacgatg 


gcacgacgac 


tattgccaat 


aaccttacca 


gcacggttca 


agtgtttacg 


3240 




gactcggagt 


atcagctccc 


gtacgtgctc 


gggtcggcgc 


accaaggctg 


tctcccgccg 


3300 




tttccagcgg 


acgtcttcat 


ggtccctcag 


tatggatacc 


tcaccctgaa 


caacggaagt 


3360 


20 


caagcggtgg 


gacgctcatc 


cttttactgc 


ctggagtact 


tcccttcgca 


gatgctaagg 


3420 




actggaaata 


acttccaatt 


cagctatacc 


ttcgaggatg 


taccttttca 


cagcagctac 


3480 




gctcacagcc 


agagtttgga 


tcgcttgatg 


aatcctctta 


ttgatcagta 


tctgtactac 


3540 


25 


ctgaacagaa 


cgcaaggaac 


aacctctgga 


acaaccaacc 


aatcacggct 


gctttttagc 


3600 




caggctgggc 


ctcagtctat 


gtctttgcag 


gccagaaatt 


ggctacctgg 


gccctgctac 


3660 




cggcaacaga 


gactttcaaa 


gactgctaac 


gacaacaaca 


acagtaactt 


tccttggaca 


3720 


30 


gcggccagca 


aatatcatct 


caatggccgc 


gactcgctgg 


tgaatccagg 


accagctatg 


3780 




gccagtcaca 


aggacgatga 


agaaaaattt 


ttccctatgc 


acggcaatct 


aatatttggc 


3840 




aaagaaggga 


caacggcaag 


taacgcagaa 


ttagataatg 


taatgattac 


ggatgaagaa 


3900 


35 


gagattcgta 


ccaccaatcc 


tgtggcaaca 


gagcagtatg 


gaactgtggc 


aaataacttg 


3960 




cagagctcaa 


atacagctcc 


cacgactgga 


actgtcaatc 


atcagggggc 


cttacctggc 


4020 




atggtgtggc 


aagatcgtga 


cgtgtacctt 


caaggaccta 


tctgggcaaa 


gattcctcac 


4080 


40 


acggatggac 


actttcatcc 


ttctcctctg 


atgggaggct 


ttggactgaa 


acatccgcct 


4140 




cctcaaatca 


tgatcaaaaa 


tactccggta 


ccggcaaatc 


ctccgacgac 


tttcagcccg 


4200 




gccaagtttg 


cttcatttat 


cactcagtac 


tccactggac 


aggtcagcgt 


gga.aattgag 


4260 


45 


tgggagctac 


agaaagaaaa 


cagcaaacgt 


tggaatccag 


agattcagt a 


cacttccaac 


4320 




tacaacaagt 


ctgttaatgt 


ggactttact 


gtagacacta 


atggtgttta 


tagtgaacct 


4380 




cgccctattg 


gaacccggta 


tctcacacga 


aacttgtgaa 


tcctggttaa 


tcaataaacc 


4440 


50 


gtttaattcg 


tttcagttga 


actttggctc 


ttgtgcactt 


ctttatcttt 


atcttgtttc 


4500 




catggctact 


gcgt agataa 


gcagcggcct 


gcggcgcttg 


cgcttcgcgg 


tttacaactg 


4560 




ctggttaata 


tttaactctc 


gccatacctc 


tagtgatgga 


gttggccact 


ccctctatgc 


4620 


55 


gcactcgctc 


gctcggtggg 


gcctggcgac 


caaaggtcgc 


cagacggacg 


tgctttgcac 


4680 
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55 



gtccggcccc 


accgagcgag 


cgagtgcgca 


tagagggagt 


ggccaa 




4726 


<210> 9 
<211> 3098 
<212> DNA 
<213> new 


AAV serotype, clone 42.2 








<400> 9 
gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


12 0 


ccgccaaggc 


cattctcggc 


ggcagcaagg 


tgcgcgtgga 


ccaaaagtgc 


aagtcttccg 


18 0 


cccagatcga 


tcccaccccc 


gtgatcgtca 


cttccaacac 


caacatgtgc 


gctgtgattg 


240 


acgggaacag 


caccaccttc 


gagcaccagc 


agccgttaca 


agaccggatg 


ttcaaatttg 


300 


aactcacccg 


ccgtctggag 


cacgactttg 


gcaaggtgac 


aaagcaggaa 


gtcaaagagt 


360 


tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


420 


gtggagccaa 


caagagaccc 


gcccccgatg 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 


gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


age tccggtg 


gactttgccg 


54 0 


acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 


agacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accagagact 


660 


gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 


ggaaactctg 


tgccattcat 


catctgctgg 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 


gcgatctggt 


caacgtggac 


ctggatgacc 


gtgttt ctga 


gcaataaatg 


acttaaacca 


840 


ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 


cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 


gacaagggag 


agccggtcaa 


cgaggcagac 


gccgcggccc 


tcgagcacga 


caaggcctac 


1080 


gacaagcagc 


tcgagcaggg 


ggacaacccg 


tacctcaagt 


acaaccacgc 


cgacgccgag 


1140 


tttcaggagc 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 


caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 


cctggaaaga 


agagacccat 


agaatccccc 


gactcctcca 


cgggcatcgg 


caagaaaggc 


1320 


cagcagcccg 


ctaaaaagaa 


gctcaacttt 


gggcagactg 


gcgactcaga 


gtcagtgccc 


1380 


gacccccaac 


ctctcggaga 


acctcccgcc 


gcgccctcag 


gtctgggatc 


i:;ggtacaatg 


1440 


gctgcaggcg 


gtggcgcacc 


aatggcagac 


aataacgaag 


gcgccgacgg 


agtgggtaat 


1500 


gcctccggaa 


attggcattg 


cgattccaca 


tggctgggcg 


acagagtcat 


caccaccagc 


1560 


acccgcacct 


gggccctgcc 


cacctacaac 


aaccacctct 


acaagcagat 


atcaagtcag 


1620 


agcggggcta 


ccaacgacaa 


ccacttcttc 


ggctacagca 


ccccctgggg 


ctattttgac 


1680 
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ttcaacagat 


tccactgcca 


cttctcacca 


cgtgactggc 


agcgactcat 


caacaacaac 


1740 


tggggattcc 


ggcccagaaa 


gctgcggttc 


aagttgttca 


acatccaggt 


caaggaggtc 


1800 


acgacgaacg 


acggcgttac 


gaccatcgct 


aataacctta 


ccagcacgat 


tcaggtcttc 


1860 


tcggactcgg 


agtaccaact 


gccgtacgtc 


ctcggctctg 


cgcaccaggg 


ctgcctccct 


1920 


ccgttccctg 


cggacgtgtt 


catgattcct 


cagtacggat 


atctgactct 


aaacaacggc 


1980 


agtcagtct g 


tgggacgt tc 


ctccttctac 


tgcctggagt 


acttt cctt c 


tcagatgctg 


2040 


agaacgggca 


ataactttga 


att cagctac 


accttt gagg 


aagtgcctt t 


ccacagcagc 


2100 


tatgcgcaca 


gccagagcct 


ggaccggctg 


atgaatcccc 


tcatcgacca 


gtacctgt ac 


2160 


tacctggccc 


gqacccagag 


cactacgggq 
^^^^ 


tccacaaggg 


age tgcagtt 


ccat caggct 


2220 


gggcccaaca 


ccatggccga 


gcaatcaaag 


aactggctgc 


ccggaccctg 


ttat cggcag 


2280 


cagagactgt 


caaaaaacat 


agacagcaac 


aacaacagta 


actttgcctg 


gaccggggcc 


2340 


actaaatacc 


atctgaatgg 


tagaaattca 


ttaaccaacc 


cgcgcgtagc 


catggccacc 


2400 


aacaaggacg 


acgaggacca 


gttctttccc 


atcaacggag 


tgctggtttt 


tggcgaaacg 


2460 


ggggctgcca 


acaagacaac 


gctggaaaac 


gtgctaatga 


ccagccragcra 


ggagatcaaa 


252 0 


accaccaatc 


ccgtggctac 


agaagaatac 


ggtgtggtct 


ccagcaacct 


gcaatcgtct 


2580 


acggccggac 


cccagacaca 


gactgtcaac 


agccaggggg 


etc tgcccgg 


catggtctgg 


2640 


cagaaccggg 


acgtgtacct 


gcagggtccc 


atctgggcca 


aaattcctca 


cacggacggc 


2700 


aactttcacc 


cgtctcccct 


gatgggcgga 


tttggactca 


aacacccgcc 


tcctcaaatt 


2760 


c'bca'tcaaaa 


acaccccggt: 


acc tgctaat 


cctccagagg 


t gtttactcc 


tgccaagttt 


282 0 


gcctcattta 


tcacgcagta 


cagcaccggc 


caggtcagcg 


tggagatcga 


gtgggaactg 


2880 


cagaaagaaa 


acagcaaacg 


ctggaatcca 


gagattcagt 


acacctcaaa 


ttatgccaag 


2940 


tctaataatg 


tggaatttgc 


tgtcaacaac 


gaaggggttt 


atactgagcc 


tcgccccatt 


3000 


ggcacccgtt 


acctcacccg 


taacctgtaa 


ttgcctgtta 


atcaataaac 


cggttaattc 


3060 


gtttcagttg 


aactttggtc 


tctgcgaagg 


gcgaattc 






3098 



40 



45 <210> 10 

<211> 3098 
<212> DNA 

<213> new AAV serotype, clone 16.3 

<400> 10 

50 gaattcgccc ttcgcagaga ccaaagttca actgaaacga atcaaccggt ttattgatta 60 

acaagtaatt acaggttacg ggtgaggtaa cgggtgccaa tggggcgagg ctcagtataa 120 
accccttcgt tgttgacagc aaattccaca ttattagact tggcataatt tgaggtgtac 180 

55 tgaatctctg gattccagcg tttgctgttt tctttctgca gttcccactc gatctccacg 240 
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ctgacctggc 


cggtgctgta 


ctgcgtgata 


aat:gaggcaa 


actaggcagg 


agtaaacacc 


300 


5 


cctggaggat 


tagcaggtac 


cggggtgttt 


ttgatgagaa 


tttgaggagg 


cgggtgtttg 


360 




agtccaaatc 


cgcccatcag 


gggagacggg 


tgaaagttgc 


cgtccgtgtg 


aggaattttg 


420 




gcccagatgg 


gaccctgcag 


gtacacgtcc 


cggttctgcc 


agaccatgcc 


gggcagagcc 


480 


10 


ccctggctgt 


tgacagtctg 


tgtctggggt 


ccggccgtag 


accattgcag 


gttgctggag 


540 




accacaccgt 


attcttctgt 


agccacggga 


ttggtggttt 


tgatctcctc 


ctcgctggtc 


600 




attagcacgt 


tttccagcgt 


tgtcttgttg 


gcagcccccg 


ttttgccaaa 


aaccagcact 


660 


15 


ccgttgatgg 


gaaagaactg 


gccctcgtcg 


tccttgttgg 


tggccatggc 


tacgcccggg 


720 




ttggttaatg 


aatttctacc 


attcagatgg 


tatttagtgg 


ccccggtcca 


ggcaaagtta 


780 




ctgttgttgt 


tgctgtctat 


gttttttgac 


agtctctgct 


gccgataaca 


gggtccgggc 


840 


20 


agccagttct 


ttgattgctc 


ggccatggtg 


ttgggcccag 


cctgatggaa 


ctgcagctcc 


900 




cttgtggacc 


ccgtagtgct 


ctgggtccgg 


gccaggtagt 


acaggtactg 


gtcgatgagg 


960 




ggattcatca 


gccggtccag 


gctctggctg 


tgcgcatagc 


tgctgtggaa 


aggcacttcc 


1020 


25 


tcaaaggtgt 


agctgaattc 


aaagttattg 


cccgttctca 


gcatctgaga 


aggaaagtac 


1080 




tccaggcagt 


agaaggagga 


acgtcccata 


gactgactgc 


cgttgtttag 


agtcagatat 


1140 




ccgtactgag 


gaatcatgaa 


cacgtccgca 


gggaacggag 


ggaggcagcc 


ctggtgcgca 


1200 


30 


gagccgagga 


cgtacggcag 


ttggtactcc 


gagtccgaga 


agacctgaat 


cgtgctggta 


1260 




aggttattag 


cgatggtcgt 


aacgccgtcg 


ttcgtcgtga 


cctccttgac 


ctggatgttg 


1320 




aacaacttga 


accgcagctt 


tctgggccgg 


aatccccagt 


tgttgttgat 


gagtcgctgc 


1380 


35 


cagtcacgtg 


gtgagaagtg 


gcagtggaat 


ctgttgaagt 


caaaatagcc 


ccagggggtg 


1440 




ctgtagccga 


agaagtggtt 


gtcgttggta 


gccccgctct 


gacttgatat 


ctgcttgtag 


1500 




aggtggttgt 


tgtaggtggg 


cagggcccag 


gtgcgggtgc 


tggtggtgat 


gactctgtcg 


1560 


40 


cccagccatg 


tggaatcgca 


atgccaattt 


ccggaggcat 


tacccactcc 


gtcggcgcct 


1620 




tcgttattgt 


ctgccattgg 


tgcgccaccg 


cctgcagcca 


ttgtaccaga 


tcccagacct 


1680 




gagggcgcgg 


cgggaggttc 


tccgagaggt 


tgggggtcgg 


gcactgact c 


tgagtcgcca 


1740 


45 


gtctgcccaa 


agttgagctt 


ctttttagcg 


ggctgctggc 


ctttcttgcc 


gatgcccgtg 


1800 




gaggagtcgg 


gggattctat 


gggtctcttc 


tttccaggag 


ccgtcttagc 


cccttcctca 


1860 




accagaccga 


gaggttcgag 


aacccgctt c 


ttggcctgga 


agactgctcg 


cccgaggttg 


1920 


50 


c ccccaaaag 


acgtatcttc 


t tgaagacgc 


tcct gaaact 


cagcgtcggc 


gtggtt gtac 


1980 




t tgaggtacg 


ggttgtcccc 


ctgctcgagc 


tgcttgtcgt 


aggccttgtc 


gt gc tcgagg 


2040 




gccgcggcgt 


ctgcctcgtt 


gaccggctct 


cccttgtcga 


gtccgttgaa 


gggtccgagg 


2100 


55 


tacttgtagc 


caggaagcac 


cagaccccgg 


ccgtcgtcct 


gcttttgctg 


gttggctttg 


2160 
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ggtttcgggg 


ctccaggttt 


caagtcccac 


cactcgcgaa 


tgccctcaga 


gaggttgtcc 


2220 


5 


tcgagccaat 


ctggaagata 


accatcggca 


gccatacctg 


gtttaagtca 


tttattgctc 


2280 




agaaacacag 


tcatccaggt 


ccacgttgac 


cagatcgcag 


gccgagcaag 


caatctcggg 


2340 




agcccgcccc 


agcagatgat 


gaatggcaca 


gagtttccga 


tacgtcctct 


ttctgacgac 


2400 


10 


cggttgagat 


tctgacacgc 


cggggaaaca 


ttctgaacag 


tctctggtcc 


cgtgcgtgaa 


2460 




gcaaatgttg 


aaattctgat 


tcattctctc 


gcatgtcttg 


cagggaaaca 


gcatctgaag 


2520 




catgcccgcg 


tgacgagaac 


atttgttttg 


gtacctgtcg 


gcaaagtcca 


ccggagctcc 


2580 


15 


ttccgcgtct 


gacgtcgatg 


gatccgcgac 


tgaggggcag 


gcccgcttgg 


gctcgctttt 


2640 




atccgcgtca 


tcgggggcgg 


gcctcttgtt 


ggctccaccc 


tttcjtgacgt 


agaactcatg 


2700 




cgccacctcg 


gtcacgtgat 


cctgcgccca 


gcggaagaac 


tctttgactt 


cctgct ttgt 


2760 


20 


caccttgcca 


aagtcctgct 


ccagacggcg 


ggtgagttca 


aatttgaaca 


tccggtcttg 


2820 




taacggctgc 


tggtgctcga 


aggtggtgct 


gttcccgtca 


atcacggcgc 


acatgttggt 


2880 




gttggaagtg 


acgatcacgg 


gggtgggatc 


gatctgggcg 


gacgacttgc 


acttttggtc 


2940 


25 


cacgcgcacc 


ttgctgccgc 


cgagaatggc 


cttggcggac 


tccacgacct 


tggccgtcat 


3000 




cttgccctcc 


tcccaccaga 


tcaccatctt 


gtcgacgcaa 


tcgttgaagg 


gaaagttctc 


3060 




attggtccag 


ttgacgcagc 


cgtagaaagg 


gcgaattc 






3098 



<210> 11 

<211> 3121 

<212> DNA 

<213> new AAV serotype, clone 2 9.3 

<400> 11 



40 



45 



50 



gaattcgccc 


ttcgcagaga 


ccaaagttca 


ac tgaaac ga 


atcaaccggt 


ttattgatta 


60 


acaagcaatt 


acagattacg 


ggtgaggtaa 


cgggtgccga 


tggggcgagg 


ctcagaataa 


120 


gtgccatctg 


tgttaacagc 


aaagtccaca 


tttgtagatt 


tgtagtagtt 


ggaagtgtat 


180 


tgaatctctg 


ggttccagcg 


tttgctgttt 


tctttctgca 


gctcccattc 


aatttccacg 


240 


ctgacctgtc 


cggtgctgta 


ctgcgtgatg 


aacgacgcca 


gcttagcttg 


actgaaggta 


300 


gttggaggat 


ccgcgggaac 


aggtgtattc 


ttaatcagga 


tctgaggagg 


cgggtgtttc 


360 


agtccaaagc 


cccccatcag 


cggcgaggga 


tgaaagtttc 


cgtccgtgtg 


aggaatcttg 


420 


gcccagatag 


gaccctgcag 


gtacacgtcc 


cggttctgcc 


agaccatgcc 


aggtaaggct 


480 


ccttgactgt 


tgacggcccc 


tacaatagga 


gcggcgtttt 


gctgttgcag 


gttatcggcc 


540 


accacgccgt 


actgttctgt 


ggccactggg 


ttggtggttt 


taatttcttc 


ctcactggtt 


600 


agcataacgc 


tgctatagtc 


cacgttgcct 


tttccagctc 


cctgtttccc 


aaacattaag 


660 


actccgctgg 


acggaaaaaa 


tcgctcttcg 


tcgtccttgt 


gggttgccat 


agcgacaccg 


720 


ggatttacca 


gagagtctct 


gccattcaga 


tgatacttgg 


tggcaccggt 


ccaggcaaag 


780 
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ttgctgttgt 


tattttgcga 


cagtgtcgtg 


gagacgcgtt 


gctgccggta 


gcagggcccg 


840 


ggtagccagt 


ttttggcctg 


agccgacatg 


ttattaggcc 


cggcctgaga 


aaatagcaac 


900 


tgctgagttc 


ctgcggtacc 


tcccgtggac 


tgagtccgag 


acaggtagta 


caggtactgg 


960 


tcgatgaggg 


ggttcatcag 


ccggtccagg 


ctttggctigt 


gcgcgtagct 


gctgtgaaaa 


1020 


ggcacgtcct 


caaactggta 


gctgaactca 


aagttgttgc 


ccgttctcag 


catttgagaa 


1080 


ggaaagtact 


ccaggcagta 


gaaggaggaa 


cggcccacgg 


cctgactgcc 


attgttcaga 


1140 


gtcaggtacc 


cgtactgagg 


aatcatgaag 


acgtccgccg 


ggaacggagg 


caggcagccc 


1200 


t ggcgcgcag 


agccgaggac 


gtacgggagc 


tggtattccg 


agtccgtaaa 


gacctgaatc 


1260 


gtgctggtaa 


ggttattggc 


gatggtcttg 


gtgccttcat 


tctgcgtgac 


ctccttgacc 


1320 


tggatgttga 


agagcttgaa 


gtt gagtctc 


ttgggccgga 


atccccagt t 


gttg ttgatg 


1380 


agtcgctgcc 


agtcacgtgg 


tgagaagtgg 


cagtggaatc 


tgttaaagtc 


aaaatacccc 


1440 


cagggggtgc 


tgtagccgaa 


gtaggtgttg 


tcgttggtgc 


ttcctcccga 


agtcccgttg 


1500 


gagatttgct 


tgtagaggtg 


gttgttgtag 


gtggggaggg 


cccaggttcg 


ggtgctggtg 


1560 


gtgatgactc 


tgtcgcccag 


ccatgtggaa 


tcgcaatgcc 


aatttcctga 


ggaactaccc 


1620 


actccgtcgg 


cgccttcgtt 


attgtctgcc 


attggagcgc 


caccgcctgc 


agccattgta 


1680 


ccagatccca 


gaccagaggg 


gcctgcgggg 


ggttctccga 


t tggttgagg 


gtcgggcact 


1740 


gactctgagt 


cgccagtctg 


cccaaagttg 


agtctctttt 


tcgcgggctg 


ctggcctttc 


1800 


ttgccgatgc 


ccgtagtgga 


gtctggagaa 


cgctggggtg 


atggctctac 


cggtctcttc 


1860 


tttccaggag 


ccgtcttagc 


gccttcctca 


accagaccga 


gaggttcgag 


aacccgcttc 


1920 


ttggcctgga 


agactgctcg 


t ccgaggttg 


cccccaaaag 


acgtatcttc 


ttgcagacgc 


1980 


tcctgaaact 


cggcgtcggc 


gtggttatac 


cgcaggtacg 


gattgtcacc 


cgctttgagc 


2040 


tgctggtcgt 


aggccttgtc 


gtgctcgagg 


gccgctgcgt 


ccgccgcgtt 


gacgggctcc 


2100 


cccttgtcga 


gtccgttgaa 


gggtccgagg 


tacttgtagc 


caggaagcac 


cagaccccgg 


2160 


ccgtcgtcct 


gcttttgctg 


gttggctttg 


ggcttcgggg 


ctccaggttt 


cagcgcccac 


2220 


cactcgcgaa 


tgccctcaga 


gaggttgtcc 


tcgagccaat 


ctggaagata 


accatcggca 


2280 


gccatacctg 


atctaaatca 


tttattgttc 


aaagatgcag 


tcatccaaat 


ccacattgac 


2340 


cagatcgcag 


qcaatQcaaa 


cgtctggcac 


ctttcccatg 


atatgatgaa 


tgtagcacag 


2400 


tttctgatac 


gcctttttga 


cgacagaaac 


gggttgagat 


tctgacacgg 


gaaagcactc 


2460 


taaacagtct 


ttctgtccgt 


gagtgaagca 


gatatttgaa 


ttctgattca 


ttctctcgca 


2520 


ttgtctgcag 


ggaaacagca 


tcagattcat 


gcccacgtga 


cgagaacatt 


tgttttggta 


2580 


cctgtccgcg 


tagttgatcg 


aagcttccgc 


gtctgacgtc 


gatggctgcg 


caactgactc 


2640 


gcgcacccgt 


ttgggctcac 


ttatatctgc 


gtcactgggg 


gcgggtcttt 


tcttggctcc 


2700 
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accctttttg acgtagaatt catgctccac 

5 

aaagtctttg acttcctgct tggtgacctt 
ttcaaatttg aacatccggt cttgcaacgg 
gtcaatcacg gcgcacatgt tggtgttgga 

10 

ggccgaggac ttgcatttct ggtccacgcg 
cgactccacg accttggcgg tcatcttccc 
acagtcgttg aagggaaagt tctcattggt 

15 

c 

<210> 12 
<211> 3121 
20 <212> DNA 

<213> new AAV serotype^ clone 2! 

<400> 12 





gaattcgccc 


ttctacggct 


gcgtcaactg 


25 


tgtcgacaag 


atggtgatct 


ggtgggagga 




ggccaaagcc 


attctcggag 


gaagcaaggt 




ccagatagac 


ccgactcccg 


tgatcgtcac 


30 


cgggaactca 


acgaccttcg 


aacaccagca 




actcacccgc 


cgtctggatc 


atgactttgg 




tttccggtgg 


gcaaaggatc 


acgtggttga 


35 


tggagccaag 


aaaagacccg 


cccccagtga 




cgagtcagtt 


gcgcagccat 


cgacgtcaga 




gtaccaaaac 


aaatgttctc 


gtcacgcggg 


40 


atgcgagaga 


atgaatcaga 


attcaaatat 




agagtgcttt 


cccgtgtcag 


aatctcaacc 




actgtgctac 


attcatcata 


tcatgggaaa 


45 


ggtcgatgtg 


gatttggatg 


actgcatctt 




c tgccgatgg 


ttatcttcca 


gattggctcg 




ggtgggcgct 


gaaacctgga 


gccccgaagc 


50 


gccggggtct 


ggtgcttcct 


ggctacaagt 




gggagcccgt 


caacgcggcg 


gacgcagcgg 




agctcaaagc 


gggtgacaat 


ccgtacctgc 


55 


agcgtctgca 


agaagatacg 


tcttttgggg 



ctcaaccacg tgatcctttg cccaccggaa 2760 

cccaaagtca tgatccagac ggcgggtgag 2820 

ctgctggtgt tcgaaggtcg ttgagttccc 2880 

ggtgacgatc acgggagtcg ggtctatctg 2940 

caccttgctt cctccgagaa tggctttggc 3000 

ctcctcccac cagatcacca tcttgtcgac 3060 

ccagttgacg cagccgtaga agggcgaatt 3120 

3121 

gaccaatgag aactttccct tcaacgactg 60 

ggggaagatg accgccaagg tcgtggagtc 120 

gcgcgtggac cagaaatgca agtcctcggc 180 

ctccaacacc aacatgtgcg ccgtgattga 240 

gccgttgcaa gaccggatgt tcaaatttga 300 

gaaggtcacc aagcaggaag tcaaagactt 360 

ggtggagcac gaattctacg tcaaaaaggg 42 0 

cgcagatata agtgagccca aacgggtgcg 480 

cgcggaagct tcgatcaact acgcagacag 540 

catgaatctg atgctgtttc cctgcagaca 600 

ctgcttcact cacggacaga aagactgttt 660 

cgtttctgtc gtcaaaaagg cgtatcagaa 720 

ggtgccagac gcttgcactg cctgcgatct 780 

tgaacaataa atgatttaaa tcaggtatgg 840 

aggacaacct ctctgagggc attcgcgagt 900 

ccaaagccaa ccagcaaaag caggacggcg 960 

acctcggacc cttcaacgga ctcgacaagg 1020 

ccctcgagca cgacaaggcc tacgaccagc 1080 

ggtataacca cgccgacgcc gagtttcagg 1140 

gcaacctcgg gcgagcagtc ttccaggcca 1200 
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agaagcgggt 


tctcgaacct 


ctcggtctgg 


ttgaggaagg 


cgctaagacg 


gctcctggaa 


1260 


5 


agaagagacc 


ggtagagcca 


tcaccccagc 


gttctccaga 


ctcctctacg 


ggcatcggca 


1320 




agaaaggcca 


gcagcccgcg 


aaaaagagac 


tcaactttgg 


gcagactggc 


gactcagagt 


1380 




cagtgcccga 


ccctcaacca 


atcggagaac 


cccccgcagg 


cccctctggt 


ctgggatctg 


1440 


10 


gtacaatggc 


tgcaggcggt 


ggcgctccaa 


tggcagacaa 


taacgaaggc 


gccgacggag 


1500 




tgggtagttc 


ctcaggaaat 


tggcattgcg 


attccacatg 


gctgggcgac 


tgagtcatca 


1560 




ccaccagcac 


ccgaacctgg 


gccctcccca 


cctacaacaa 


ccacctctac 


aagcaaatct 


1620 


15 


ccaacgggac 


ttcgggagga 


agcaccaacg 


acaacaccta 


cttcggctac 


agcaccccct 


1680 




gggggtattt 


tgactttaac 


agattccact 


gccacttctc 


accacgtgac 


tggcagcgac 


1740 




tcatcaacaa 


caactgggga 


ttccggccca 


agagactcaa 


cttcaagctc 


ttcaacatcc 


1800 


20 


aggtcaagga 


ggtcacgcag 


aatgaaggca 


ccaagaccat 


cgccaataac 


cttaccagca 


1860 




cgattcaggt 


ctttacggac 


tcggaatacc 


agctcccgta 


cgtcctcggc 


tctgcgcacc 


1920 




agggctgcct 


gcctccgttc 


ccggcggacg 


tcttcatgat 


tcctcagtac 


gggtacctga 


1980 


25 


ctctgaacaa 


tggcagtcag 


gccgtgggcc 


gttcctcctt 


ctactgcctg 


gagtactttc 


2040 




cttctcaaat 


gctgagaacg 


ggcaacaact 


ttgagttcag 


ctaccagttt 


gaggacgtgc 


2100 




cttttcacag 


cagctacgcg 


cacagccaaa 


gcctggaccg 


gctgatgaac 


cccctcatcg 


2160 


30 


accagtacct 


gtactacctg 


tctcggactc 


agtccacggg 


aggtaccgca 


ggaactcagc 


2220 




agttgctatt 


ttctcaggcc 


gggcctaata 


acatgtcggc 


tcaggccaaa 


aactggctac 


2280 




ccgggccctg 


ctaccggcag 


taacgcgtct 


ccacgacact 


gtcgcaaaat 


aacaacagca 


2340 


35 


actttgtctg 


gaccggtgcc 


accaagtatc 


atctgaatgg 


cagagactct 


ctggtagatc 


2400 




ccggtgtcgc 


tatggcaacc 


cacaaggacg 


acgaagagcg 


attttttccg 


tccagcggag 


2460 




tcataatgtt 


tgggaaacag 


ggagctggaa 


aagacaacgt 


ggactatagc 


agcgtcatgc 


2520 


40 


taaccagtga 


ggaagaaatt 


aaaaccacca 


acccagtggc 


cacagaacag 


tacggcgtgg 


2580 




tggccgataa 


cctgcaacag 


caaaacgccg 


ctcctattgt 


aggggccgtc 


aacagtcaag 


2640 




gagccttacc 


tggcatggtc 


tggcagaacc 


gggacgtgta 


cctgcagggt 


cctacctggg 


2700 


45 


ccaagattcc 


tcacacggac 


ggaaactttc 


atccctcgcc 


gctgatggga 


ggctttggac 


2760 




tgaaacaccc 


gcctcctcag 


atcctgatta 


agaatacacc 


tgttcccgcg 


gatcctccaa 


2820 




ctaccttcag 


tcaagctaag 


ctggcgtcgt 


tcatcacgca 


gtacagcacc 


ggacaggtca 


2880 


50 


gcgtggaaat 


tgaatgggag 


ctgcaggaag 


aaaacagcaa 


acgctggaac 


ccagagattc 


2940 




aatacacttc 


caactactac 


aaatctacaa 


atgtggactt 


tgctgttaac 


acagatggca 


3000 




cttattctga 


gcctcgcccc 


atcggcaccc 


gttacctcac 


ccgtaatctg 


taattgcttg 


3060 


55 


ttaatcaata 


aaccggttga 


ttcgtttcag 


ttgaactttg 


gtctctgcga 


agggcgaatt 


3120 
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<210> 13 
<211> 3121 
<212> DNA 

<213> new AAV serotype, clone 29.5 








<400> 13 
gaattcgccc 


ttcgcgagac 


caaagttcaa 


ctgaaacgaa 


tcaaccggtt 


tattgattaa 


60 


Cciagcciatt cl 


cagattacgg 


gtgaggtaac 


craataccaat 


ggggcgaggc 


tcagaataag 


120 


t gccatct cft 


gt taacagca 


aag t ccacat 


ttgtagattt 


gtagtagttg 


gaagtgtatt 


180 


gaatctc t gg 


gttccagcgt 


ttgctgtttt 


ctttctgcag 


ctcccattca 


atttccacgc 


240 


t ga.cc tgtcc 






acgacgccag 


cttagcttga 


ctgaaggtag 


300 


t tggaggat c 


cgcgggaaca 


ggt gt attct 


taatcaggat 


ctgaggaggc 


gggtgtttca 


360 


gtccaaagcc 


tcccatcagc 


ggcgagggat 


gaaagtttcc 


gtccgtgtga 


ggaatcttgg 


420 


cccagatagg 


accctgcagg 


tacacgtccc 


ggttctgcca 


gaccatgcca 


ggtaaggctc 


480 


cttgactgtt 


gacggcccct 


acaataggag 


cggcgttttg 


ctgttgcagg 


ttatcggcca 


540 


ccacgccgt a 


ctgttctgtg 


gccactgggt 


tggtggtttt 


aatttcttcc 


tcactggtta 


600 


gcataacgct 


gctatagtcc 


acgttgtctt 


ttccagctcc 


ctgtttccca 


aacattaaga 


660 


ctccgctgga 


cggaaaaaat 


cgctcttcgt 


cgtccttgtg 


ggttgccata 


gcgacaccgg 


720 


gatttaccag 


agagtctctg 


ccattcagat 


gatacttggt 


ggcaccggtc 


caggcaaagt 


780 


tgctgttgtc 


attttgcgac 


agtgtcgtgg 


agacgcgttg 


ctgccggtag 


cagggcccgg 


840 


gtagccagtt 


tttggcctga 


gccgacatgt 


tattaggccc 


ggcctgagaa 


aatagcaact 


900 


gctgagttcc 


tgcggtacct 


cccgtggact 


gagtccgaga 


caggtagtac 


aggtactggt 


960 


cgatgagggg 


gttcatcagc 


cggtccaggc 


tttggctgtg 


cgcgtagctg 


ctgtgaaaag 


1020 


gcacgtcctc 


aaactggtag 


ctgaactcaa 


agttgttgcc 


cgttctcagc 


atttgagaag 


1080 


gaaagtactc 


caggcagtag 


aaggaggaac 


ggcccacggc 


ctgactgcca 


ttgttcagag 


1140 


tcaggtaccc 


gtactgagga 


atcatgaaga 


cgtccgccgg 


gaacggaggc 


aggcagccct 


1200 


ggtgcgcaga 


gccgaggacg 


tacgggagct 


ggtattccga 


gtccgtaaag 


acctgaatcg 


1260 


tgctggtaag 


gttattggcg 


atggtcttgg 


tgccttcatt 


ctgcgtgacc 


tccttgacct 


1320 


ggatgttgaa 


gagcttgaag 


ttgaggctct 


tgggccggaa 


tccccagttg 


ttgttgatga 


1380 


gtcgctgcca 


gtcacgtggt 


gagaagtggc 


^gtggaatct 


gttaaagtca 


aaataccccc 


1440 


agggggtgct 


gtagccgaag 


taggtgttgt 


cgttggtgct 


tcctcccgaa 


gtcccgttgg 


1500 


agatttgctt 


gtagaggtgg 


ttgttgtagg 


tggggagggc 


ccaggttcgg 


gtgctggtgg 


1560 


tgatgactcc 


gtcgcccagc 


catgtggaat 


cgcaatgcca 


atttcctgag 


gaactaccca 


1620 


ctccgtcggc 


gccttcgtta 


ttgtctgcca 


ttggagcgcc 


accgcctgca 


gccattgtac 


1680 
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cagatcccag 


accagagggg 


cctgcggggg 


gttctccgat 


tggttgaggg 


tcgggcactg 


1740 


actctgagtc 


gccagtctgc 


ccaaagttga 


gtctcttttt 


cgcgggctgc 


tggcctttct 


1800 


tgccgatgcc 


cgtagaggag 


tctggagaac 


gctggggtga 


tggctctacc 


ggtctcttct 


1860 


ttccaggagc 


cgtcttagcg 


ccttcctcaa 


ccagaccgag 


aggttcgaga 


acccgcttct 


1920 


tggcctggaa 


gactgctcgc 


ccgaggttgc 


ccccaaaaga 


cgtatcttct 


tgcagacgct 


1980 


cctgaaactc 


ggcgtcggcg 


tggttatacc 


gcaggtacgg 


attgtcaccc 


gctttgagct 


2040 


gctggtcgta 


ggccttgtcg 


tgctcgaggg 


ccgctgcgtc 


cgccgcgttg 


acgggctccc 


2100 


ccttgtcgag 


tccgttgaag 


ggtccgaggt 


acttgtagcc 


aggaagcacc 


agaccccggc 


2160 


cgtcgtcctg 


cttttgctgg 


ttggctttgg 


gcttcggggc 


tccaggtttc 


agcgcccacc 


2220 


actcgcgaat 


gccctcagag 


aggttgtcct 


cgagccaatc 


tggaagataa 


ccatcggcag 


2280 


ccatacctga 


tttaaatcat 


ttattgttca 


aagatgcagt 


catccaaatc 


cacattgacc 


2340 


agatcgcagg 


cagtgcaagc 


gtctggcacc 


tttcccatga 


tatgatgaat 


gtagcacagt 


2400 


ttctgatacg 


cctttttgac 


gacagaaacg 


ggttgagatt 


ctgacacggg 


aaagcactct 


2460 


aaacagtctt 


tctgtccgtg 


agtgaagcag 


atatttgaat 


tctgattcat 


tctctcgcat 


2520 


t gtctgcagg 


gaaacagcat 


cagattcatg 


cccacgtgac 


gagaacattt 


gttttggtac 


2580 


ctgtctgcgt 


agttgatcga 


agcttccgcg 


tctgacgtcg 


atggctgcgc 


aact gactcg 


2640 


cgcacccgtt 


tgggctcact 


tatatctgcg 


tcactggggg 


cgggtctttt 


cttggctcca 


2700 


ccctttttga 


cgtagaattc 


atgctccacc 


tcaaccacgt 


gatcctttgc 


ccaccggaaa 


2760 


aagtctttga 


cttcctgctt 


ggtgaccttc 


ccaaagtcat 


gatccagacg 


gcgggt gagt 


2820 














2880 


tcaatcacgg 


cgcacatgtt 


ggtgttggag 


gtgacgatca 


cgggagtcgg 


gtctatctgg 


2940 


gccgaggact 


tgcatttctg 


gtccacgcgc 


accttgcttc 


ctccgagaat 


ggctttggcc 


3000 


gactccacga 


ccttggcggt 


catcttcccc 


tcctcccacc 


agatcaccat 


cttgtcgaca 


3060 


cagtcgttga 


agggaaagtt 


ctcattggtc 


cagttgacgc 


agccgtagaa 


agggcgaatt 


3120 


c 












3121 



55 



<210> 14 
<211> 3131 
<212> DNA 

<213> new AAV serotype, clone 1-3 
<400> 14 

gcggccgcga attcgccctt ggctgcgtca actggaccaa tgagaacttt cccttcaatg 60 
attgcgtcga caagatggtg atctggtggg aggagggcaa gatgacggcc aaggtcgtgg 120 
agtccgccaa ggccattctc ggcggcagca aggtgcgcgt ggaccaaaag tgcaagtcgt 180 
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ccgcccagat 


cgaccccacc 


cccgtgatcg 


tcacctccaa 


caccaacatg 


tgcgccgtga 


240 


5 


ttgacgggaa 


cagcaccacc 


ttcgagcacc 


agcagcctct 


ccaggaccgg 


atgtttaagt 


300 




tcgaactcac 


ccgccgtctg 


gagcacgact 


ttggcaaggt 


gacaaagcag 


gaagtcaaag 


360 




agttcttccg 


ctgggccagt 


gatcacgtga 


ccgaggtggc 


gcatgagttt 


tacgtcagaa 


420 


10 


agggcggagc 


cagcaaaaga 


cccgcccccg 


atgacgcgga 


taaaagcgag 


cccaagcggg 


480 




cctgcccctc 


agtcgcggat 


ccatcgacgt 


cagacgcgga 


aggagctccg 


gtggactttg 


540 




ccgacaggta 


ccaaaacaaa 


tgttctcgtc 


acgcgggcat 


gcttcagatg 


ctgtttccct 


600 


15 


gcaaaacgtg 


cgagagaatg 


aatcggaatt 


tcaacatttg 


cttcacacac 


ggggtcagag 


660 




actgctcaga 


gtgtttcccc 


ggcgtgtcag 


aatctcaacc 


ggtcgtcaga 


aagaggacgt 


72 0 




atcggaaact 


ccgtgcgatt 


catcatctgc 


tggggcgggc 


tcccgagatt 


gcttgctcgg 


780 


20 


cctgcgatct 


ggtcaacgtg 


gacctggatg 


actgtgtttc 


tgagcaataa 


atgacttaaa 


840 




ccaggtatgg 


ctgccgatgg 


ttatcttcca 


gattggctcg 


aggacaacct 


ctctgagggc 


900 




attcgcgagt 


ggtgggcgct 


gaaacctgga 


gccccgaagc 


ccaaagccaa 


ccagcaaaag 


960 


25 


caggacgacg 


gccggggtct 


ggtgcttcct 


ggctacaagt 


acctcggacc 


cttcaacgga 


1020 




ctcgacaagg 


gggagcccgt 


caacgcggcg 


gacgcagcgg 


ccctcgagca 


cgacaaggct 


1080 




tacgaccagc 


agctgcaggc 


gggtgacaat 


ccgtacctgc 


ggtataacca 


cgccgacgcc 


1140 


30 


gagtttcagg 


agcgtctgca 


agaagatacg 


tcttttgggg 


gcaacctcgg 


gcgagcagtc 


1200 




ttccaggcca 


agaagcgggt 


tctcgaacct 


ctcggtctgg 


ttgaggaagg 


cgctaagacg 


1260 




gctcctggaa 


agaagagacc 


ggtagagcca 


tcaccccagc 


gttctccaga 


ctcctctacg 


1320 


35 


ggcatcggca 


agaaaggcca 


acagcccgcc 


agaaaaagac 


tcaattttgg 


tcagactggc 


1380 




gactcagagt 


cagttccaga 


ccctcaacct 


ctcggagaac 


ctccagcagc 


gccctctggt 


144 0 




gtgggaccta 


atacaatggc 


tgcaggcggt 


ggcgcaccaa 


tggcagacaa 


taacgaaggc 


1500 


40 


gccgacggag 


tgggtagttc 


ctcgggaaat 


tggcattgcg 


attccacatg 


gctgggcgac 






agagtcatca 


ccaccagcac 


ccgaacctgg 


gccctgccca 


cctacaacaa 


ccacctctac 


1620 




aagcaaatct 


ccaacgggac 


atcgggagga 


gccaccaacg 


acaacaccta 


cttcggctac 


1680 


45 


agcaccccct 


gggggtattt 


tgactttaac 


agattccact 


gccacctttc 


accacgtgac 


1740 




tggcagcgac 


tcatcaacaa 


caactgggga 


ttccgaccca 


agagactcag 


cttcaagctc 


1800 




ttcaacatcc 


aggtcaagga 


ggtcacgcag 


aatgaaggca 


ccaagaccat 


cgccaataac 


1850 


50 


ctcaccagca 


ccatccaggt 


gtttacggac 


tcggagtacc 


agctgccgta 


cgttctcggc 


1920 




tctgtccacc 


agggctgcct 


gcc tccgttc 


ccggcggacg 


tgttcatgat 


tccccagtac 


1980 




ggctacctaa 


cactcaacaa 


cggtagtcag 


gccgtgggac 


gctcctcctt 


ctactgcctg 


2040 


55 


gaatactttc 


cttcgcagat 


gctgagaacc 


ggcaacaact 


tccagtttac 


ttacaccttc 


2100 



72 



EP1 310 571 A2 





gaggacgtgc 


ctttccacag 


cagctacgcc 


cacagctaga 


gcttggaccg 


gctgatgaat 


2160 


5 


cctctgattg 


accagtacct 


gtactacttg 


tctcggactc 


aaacaacagg 


aggcacggca 


2220 




aatacgcaga 


ctctgggctt 


cagccaaggt 


gggcctaata 


caatggccaa 


tcaggcaaag 


2280 




aactggctgc 


caggaccctg 


ttaccgccaa 


caacgcgtct 


caacgacaac 


cgggcaaaac 


2340 


10 


aacaatagca 


actttgcctg 


gactgctggg 


accaaatacc 


atctgaatgg 


aagaaattca 


2400 




ttggctaatc 


ctggcatcgc 


tatggcaaca 


cacaaagacg 


acgaggagcg 


tttttttccc 


2460 




agtaacggga 


tcctgatttt 


tggcaaacaa 


aatgctgcca 


gagacaatgc 


ggattacagc 


2520 


15 


gatgtcatgc 


tcaccagcga 


ggaagaaatc 


aaaaccacta 


accctgtggc 


tacagaggaa 


2580 




tacggtatcg 


tggcagataa 


cttgcagcag 


caaaacacgg 


ctcctcaaat 


tggaactgtc 


2640 




aacagccagg 


gggccttacc 


cggtatggtc 


tggcagaacc 


gggacgtgta 


cctgcagggt 


2700 


20 


cccatctggg 


ccaagattcc 


tcacacggac 


ggcaacttcc 


acccgtctcc 


gctgatgggc 


2760 




ggctttggcc 


tgaaacatcc 


tccgcctcag 


atcctgatca 


agaacacgcc 


tgtacctgcg 


2820 




gatcctccga 


ccaccttcaa 


ccagtcaaag 


ctgaactctt 


tcatcacgca 


atacagcacc 


2880 


25 


ggacaggtca 


gcgtggaaat 


tgaatgggag 


ctgcagaagg 


aaaacagcaa 


gcgctggaac 


2940 




cccgagatcc 


agtacacctc 


caactactac 


aaatctataa 


gtgtggactt 


tgctgttaat 


3000 




acagaaggcg 


tgtactctga 


accccgcccc 


attggcaccc 


gttacctcac 


ccgtaatctg 


3060 


30 


taattgcctg 


ttaatcaata 


aaccggttga 


ttcgtttcag 


ttgaactttg 


gtctctgcga 


3120 




agggcgaatt 


c 










3131 



<2a0> 15 

<211> 3127 

<212> DNA 

<213> new AAV serotype, clone 13-3b 

<400> 15 



40 



45 



50 



55 



gcggccgcga 


attcgccctt 


cgcagagacc 


aaagttcaac 


tgaaacgaat 


caaccggttt 


60 


attgattaac 


atgcaattac 


agattacggg 


tgaggtaacg 


agtgccaata 


gggcgaggct 


120 


cagagtaaac 


accctggctg 


tcaacggcaa 


agtccacacc 


agtctgcttt 


tcaaagttgg 


180 


aggtgtactg 


aatctccggg 


tcccagcgct 


tgctgttttc 


cttctgcagc 


tcccactcga 


240 


tttccacgct 


gacttgtccg 


gtgctgtact 


gtgtgatgaa 


cgaagcaaac 


ttggcaggag 


300 


taaacacctc 


cggaggatta 


gcgggaacgg 


gagtgttctt 


gatcaggatc 


tgaggaggcg 


360 


gatgtttaag 


tccaaagccg 


cccatcaaag 


gagacgggtg 


aaagttgcca 


tccgtgtgag 


420 


gaatcttggc 


ccagatggga 


ccctgcaggt 


acacgtcccg 


gttctgccag 


accatgccag 


480 


gtaaggctcc 


ctggttgttg 


acaacttgtg 


tctgggctgc 


agtattagcc 


gcttgtaagt 


540 


tgctgctgac 


tatcccgtat 


tcttccgtgg 


ctacaggatt 


agtaggacga 


atttcttctt 


600 


catttgtcat 


taacacattt 


tccaatgtag 


ttttgttagt 


tgctccagtt 


tttccaaaaa 


660 
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tcaggactcc 


gctggatggg 


aaaaagcggt 


cctcgtcgtc 


cttgtgagtt 


gccatggcga 


720 


5 


cgccgggatt 


aaccaacgag 


tttctgccgt 


tcaggtgata 


tttggtggca 


ccagtccaag 


780 




caaagttgct 


gttgttgttt 


tgatccagcg 


ttttggagac 


cctttgttgc 


cggaagcagg 


840 




gtccaggtaa 


ccaattcttg 


gcttgttcgg 


ccatagttga 


aggcccgccc 


tggtaaaact 


900 


10 


gcagttcccg 


attgccagct 


gtgcctcctg 


ggtcactctg 


tgttctggcc 


aggtagtaca 


960 




agtactggtc 


gatgagggga 


ttcatcagcc 


ggtccaggct 


ctggctgtgt 


gcgtagctgc 


1020 




tgtggaaagg 


cacgtcctcg 


aagctgtagc 


tgaactcaaa 


gttgttgccc 


gttctcagca 


1080 


15 


tctgagaggg 


gaagtactcc 


aggcagtaga 


aggaggaacg 


tcccacagac 


tgactgccat 


1140 




tgttgagagt 


caggtagccg 


tactgaggaa 


tcatgaagac 


gtccgccggg 


aacggaggca 


1200 




ggcagccctg 


gtgcgcagag 


ccgaggacgt 


acggcagctg 


gtattccgag 


tccgagaata 


1260 


20 


cctgaatcgt 


gctggtaagg 


ttattagcga 


tggtcgtaac 


gccgtcattc 


gtcgtgacct 


1320 




ccttgacctg 


gatgtt gaag 


agcttgaacc 


gcagcttctt 


gggccggaat 


ccccagttgt 


1380 




tgttgatgag 


tcgctgccag 


tcacgtggtg 


agaagtggca 


gtggaatctg 


ttaaagtcaa 


1440 


25 


aataccccca 


gggggtgctg 


t agccgaagt 


aggtgttgtc 


gttggtacta 


cctgcagttt 


1500 




cactggagat 


ttgctcgtag 


aggtggttgt 


tgtaggtggg 


cagggcccag 


gttcgggtgc 


1550 




tggtggtaat 


gactctgtcg 


cccagccatg 


tggaatcgca 


atgccaattt 


cctgaggcat 


1620 


30 


tacccactcc 


gtcggcacct 


tcgttattgt 


ctgccattgg 


t gcgccaccg 


cctgcagcca 


1680 




ctgtaccaga 


tcccacac ta 


gagggcgctg 


ctggaggttc 


tccgagaggt 


tgagggtcgg 


1740 




ggactgactc 


tgagtcgcca 


gtctgaccga 


aattgagtct 


ctttctggcg 


ggct gc tggc 


1800 


35 


ccttcttgcc 


gatgcccgtg 


gaggagtcgg 


gggaacgctg 


aggtgacggc 


tctaccggtc 


1860 




tcttctttgc 


sggagccgtc 


ttagcgcctt 


cctcaaccag 


accgagaggt 


tcgagaaccc 


1920 




gcttcttggc 


ctggaagact 


gctcgcccga 


ggttgccccc 


aaatgacgta 


tcttcttgca 


1980 


40 


gacgctcctg 


aaactcggcg 


tcggcgtggt 


tataccgcag 


gtacgggttg 


tcacccgcat 


2040 




tgagctgctg 


gtcgtaggcc 


ttgtcgtgct 


cgagggccgc 


tgcgtccgcc 


gcgttgacgg 


2100 




gctccccctt 


gtcgagtccg 


ttgaagggtc 


cgaggtactt 


gtagccagga 


agcaccagac 


2160 


45 


cccggccgtt 


gtcctgcttt 


tgctggttgg 


ctttgggttt 


cggggctcca 


ggtttcaggt 


2220 




cccaccactc 


gcgaatgccc 


tcagagaggt 


tgtcctcgag 


ccaatctgga 


agataaccat 


2280 




cggcagccat 


acctgattta 


aatcatttat 


tgttcaaaga 


tgcagtcatc 


caaatccaca 


2340 


50 


ttgaccagat 


cgcaggcagt 


gcaagcgtct 


ggcacctttc 


ccatgatatg 


atgaatgtag 


2400 




cacagtttct 


gatacgcctt 


tttgacgaca 


gaaacgggtt 


tagattctga 


cacgggaaag 


2460 




cactctaaac 


agtctttctg 


tccgtgagtg 


aagcagatat 


ttgaattctg 


attcattctc 


2520 


55 


tcgcattgtc 


tgcagggaaa 


cagcatcaga 


ttcatgccca 


cgtgacgaga 


acatttgttt 


2580 
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tggtacctgt 


ctgcgtagtt 


gatcgaagct 


tccgcgtctg 


acgtcgatgg 


ctgcgcaact 


2640 


5 


gactcgcgca 


cccgtttggg 


ctcacttata 


tctgcgtcac 


tgggggcggg 


tcttttcttg 


2700 




gctccaccct 


ttttgacgta 


gaattcatgc 


tccacctcaa 


ccacgtaatc 


ctttgcccac 


2760 


10 


cggaaaaagt 


ctttgacttc 


ctgcttggtg 


accttcccaa 


agtcatgatc 


cagacggcgg 


2820 


gtgagttcaa 


atttgaacat 


ccggtcttgc 


aacggctgct 


ggtgttcgaa 


ggtcgttgag 


2880 




ttcccgtcga 


tcacggcgca 


catgttggtg 


ttggagatga 


cgatcgcggg 


agtcgggtct 


2940 


15 


atctgggccg 


aggacttgca 


tttctggtcc 


acgcgcacct 


tgcttcctcc 


gagaatggct 


3000 


ttggccgact 


ccacgacctt 


ggcggtcatc 


ttcccctcct 


cccaccagat 


caccatcttg 


3060 




tcgacacagt 


cgttgaaggg 


aaagttctca 


ttggtccagt 


tgacgcagcc 


gtagaaaggg 


3120 


20 


cgaattc 












3127 


25 


<210> 16 
<211> 3106 
<212> DNA 

<213> new AAV serotype, clone 24-1 








<400> 16 
gcggccgcga 


attcgccctt 


cgcagagacc 


aaagttcaac 


tgaaacgaat 


caaccggttt 


60 




attgattaac 


aagtaattac 


aggttacggg 


tgaggtaacg 


ggtgccaatg 


gggcgaggct 


120 


30 


cagtataaac 


cccttcgttg 


ttgacagcaa 


attccacatt 


attagacttg 


gcataatttg 


180 




aggtgtactg 


aatctctgga 


ttccagcgtt 


tgctgttttc 


tttctgcagt 


tcccactcga 


240 




tctccacgct 


gacctggccg 


gtgctgtact 


gcgtgataaa 


tgaggcaaac 


ttggcaggag 


300 


35 


taaacacctc 


tggaggatta 


gcaggtaccg 


gggtgttttt 


gatgagaatt 


tgaggaggcg 


360 




ggtgtttgag 


tccaaatccg 


cccatcaggg 


gagacgggtg 


aaagttgccg 


tccgtgtgag 


420 




gaattttggc 


ccagatggga 


ccctgcaggc 


acacgtcccg 


gttctgccag 


accatgccgg 


480 


40 


gcagagcccc 


ctggctgttg 


acagtctgtg 


tctggggtcc 


ggccgtagac 


gattgcaggt 


540 




tgctggagac 


cacaccgtat 


tcttctgtag 


ccacgggatt 


ggtggttttg 


atctcctcct 


600 




cgctggtcat 


tagcacgttt 


tccagcgttg 


tcttgttggc 


agcccccgtt: 


ttgccaaaaa 


660 


45 


ccagcactcc 


gttgatggga 


aagaactggt 


cctcgtcgtc 


cttgttggtg 


gccatggcta 


720 




cgcccgggtt 


ggttaatgaa 


tttctaccat 


tcagatggta 


t ttagtggcc 


ccggtccagg 


7 8 0 




caaagttact 


gttgttgttg 


ctgtctatgt 


tttttgacag 


tctctgctgc 


cgataacagg 


840 


50 


gtccgggcag 


ccagttcttt 


gattgctcgg 


ccatggtgtt 


gggcccagcc 


tgatggaact 


900 




gcagctccct 


tgtggacccc 


gtagtgctct 


gggtccgggc 


caggtagtac 


aggtactggt 


960 




cgatgagggg 


attcatcagc 


cggtctaggc 


tctggctgtg 


cacatagctg 


ctgtggaaag 


1020 


55 


gcacttcctc 


aaaggtgtag 


ctgaattcaa 


agttattgcc 


cgttctcagc 


atctgagaag 


1080 
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gaaagtactc 


caggcagtag 


aaggaggaac 


gtcccacaga 


ctgactgccg 


ttgtttagag 


1140 


5 


tcagatatcc 


gtactgagga 


atcatgaaca 


cgtccgcagg 


gaacggaggg 


aggcagccct 


1200 




ggtgcgcaga 


gccgaggacg 


tacggcagtt 


ggtactccga 


gtccgagaag 


acctgaatcg 


12 60 




t gctggtaag 


gttattagcg 


atggtcgtaa 


cgccgtcgtt 


cgtcgtgacc 


tccttgacct 


132 0 


10 


ggatgttgaa 


caacttgaac 


cgcagctttc 


tgggccggaa 


tccccagttg 


ttgttgatga 


1380 




gtcgctgcca 


gtcacgtggt 


gagaagtggc 


agtggaatct 


gttgaagtca 


aaatagcccc 


1440 




agggggtgct 


gtagctgaag 


aagtggttgt 


cgttggtagc 


cccgctctga 


cttgatatct 


1500 


15 


gcttgtagag 


gtggttgttg 


taggtgggca 


gggcccaggt 


gcgggtgctg 


gtggtgatga 


1560 




ctctgtcgcc 


cagccatgtg 


gaatcgcaat 


gccaatttcc 


ggaggcatta 


cccactccgt 


1620 




cggcgccttc 


gttattgtct 


gccattggtg 


cgccaccgcc 


tgcagccatt 


gtaccagatc 


1680 


20 


ccagacctga 


gggcgcggcg 


ggaggttctc 


cgagaggttg 


ggggtcgggc 


actgactctg 


1740 




agtcgccagt 


ctgcccaaag 


ttgagcttct 


ttttagcggg 


ctgctggcct 


ttcttgccga 


1800 




tgcccgtgga 


ggagtcgggg 


gattctatgg 


gtctcttctt 


tccaggagcc 


gtcttagcga 


18 60 


25 


cttcctcaac 


cagaccgaga 


ggttcgagaa 


cccgcttctt 


ggcctggaag 


actgctcgcc 


192 0 




cgaggttgcc 


cccaaaagac 


gtatcttctt 


gaagacgctc 


ctgaaactcg 


gcgtcggcgt 


1980 




ggttgtactt 


gaggtacggg 


ttgtccccct 


gctcgagctg 


cttgtcgtag 


gccttgtcgt 


204 0 


30 


gctcgagggc 


cgcggcgtct 


gcctcgttga 


ccggctctcc 


cttgtcgagt 


ccgttgaagg 


2100 




gtctgaggta 


cttgtagcca 


ggaagcacca 


gaccccggcc 


gtcgtcctgc 


ttttgctggt 


2160 




tggctttggg 


tttcggggct 


ccaggtttca 


agtcccacca 


ctcgcgaatg 


ccctcagaga 


222 0 


35 


ggttgtcctc 


gagccaatct 


ggaagataac 


catcggcagc 


catacctggt 


ttaagtcatt 


228 0 




tattgctcag 


aaacacagtc 


atccaggtcc 


acgttgacca 


gatcgcaggc 


cgagcaagca 






atctcgggag 


cccgccccag 


cagatgatga 


atggcacaga 


gtttccgata 


cgtcctcttt: 




40 


ctgacgaccg 


gttgagattc 


tgacacgccg 


gggaaacatt 


ctgaacagtc 


tctggtcccg 






tgcgtgaagc 


aaatgttgaa 


attctgattc 


actctctcgc 


atgtcttgca 


gggaaacagc 


252 0 




atctgaagca 


tgcccgcgtg 


acgagaacat 


ttgttttggt 


acctgtcggc 


aaagtccacc 


2 58 0 


45 


ggagctcctt 


ccgcgtctga 


cgtcgatgga 


ttcgcgactg 


aggggcaggc 


ccgcttgggc 


2 64 0 




tcgcttttat 


ccgcgtcatc 


gggggcgggt 


ctcttgttgg 


ccccaccctt; 


tctgacgtag 


2700 




aacccatgcg 


ccacctcggt 


cacgtgatcc 


tgcgcccagc 


ggaagaacct 


tttgacttcc 


2760 


50 


tgctttgtca 


ccttgccaaa 


gttatgctcc 


agacggcggg 


tgggttcaaa 


tttgaacatc 


2820 




cggtcctgca 


acggctgctg 


gtgct cgaag 


gtggcgctgt 


tcccg tcaat 


cacggcgcac 


2880 




atgttggtgt 


tggaggtgac 


ggtcacgggg 


gtggggtcga 


tctgggcgga 


cgac ttgcac 


2940 


55 


ttttggtcca 


cgcgcacctt 


gctgccgccg 


agaatggcct 


tggcggactc 


cacgaccttg 


3000 



76 



EP1 310 571 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



gccgtcatct 


tgccctcctc 


ccaccagatc 


accatcttgt 


cggcgcaatc 


gttgaaggga 


3060 


aagttctcat 


tggtccagtt 


gacgcagccg 


tagaaagggc 


gaattc 




3106 


<210> 17 
<211> 3102 
<212> DNA 

<213> new AAV serotype, clone 27-3 








<400> 17 
gcggccgcga 


attcgccc tt 


cgcagagacc 


aaag tt caac 


t gaaacgaat 


caaccggttt 


60 


attgattaac 


aagt aattac 


aggttacggg 


tgaggtaacg 


ggtgccaatg 


gggcgaggct 


120 


c agtataaac 


cccttcgttg 


ttgacagcaa 


attccacatt 


attagacttg 


gcataatttg 


180 


aggtgtactg 


aatctctgga 


ttccagcgtt 


t get gt t - tc 


tttctgcagt 


tcccactcga 


240 


tctccacgct 


gacctggccg 


gtgctgt act 


gcgtgataaa 


tgaggcaaac 


ttggcaggag 


300 


taaacacctc 


tggaggatta 


gcaggtacc g 


gggtgttt tt 


gatgagaatt 


tgaggaggcg 


360 


ggt gttt gag 


tccaaatccg 


cccatcaggg 


gagacgggt g 


aaagttgccg 


tccgtgtgag 


420 


gaat tt c ggc 


ccagatggga 


ccctgcaggt 


acacgtcccg 


gttctgccag 


accatgccgg 


480 




I— LgyouyuLy 




^ /~* *^ ^ /*<• 

ucugyygucc 


ggccgtagac 


gattgcaggt 


540 






^ "t" 4" "f* 




ggtggttttg 


atctcctcct 


600 




Udyod^y 


L^v^dyL^y u y 


L.i,y u uyyc^ 


agcccccgtt 


ttgccaaaaa 


660 






o.yyaac^uygi. 


^ "t* <^ /T 4" ^ 

ucyuc^y cc 


cttgttggtg 


gccatggcta 


720 




yy LUetdLycict 




uc.a.gaLgg 


tttagtggcc 


ccggtccagg 


780 




y I- Ly Luy i_ug 




\,\, L.U i.y a.Cag 


tctctgctgc 


cgataacagg 


840 


^ ^ A 

y i_ccyyyca,y 


^ ^ ^ ^ ^ 
^^CiyUL.v^L.UT^ 


gattgctcgg 


c c ac g g t g 1 1 


gggcccagcc 


tgatggaact 


900 


y w cL y ^ i>» ^ \tf 


y Lry y ci v« 


y loCLy L.yLrf u 


yy yT^ccyygc 


caggtagtac 


aggtactggt 


960 






wyy i.^LrfCiyy^ 


^ "t* ^ ^ ^ ^ ^ ^ ^ 

uyyc^uy t-y 


cgcatagctg 


ctgtggaaag 


1020 


gcacttcctc 


aaaggtgtag 


ctgaattcaa 


agttattgcc 


cgttctcagc 


atctgagaag 


1080 


gaaagtactc 


caggcagcag 


aaggaggaac 


gtcccacaga 


ctgactgccg 


ttgtttagag 


1140 


tcagatatcc 


gtactgagga 


atcatgaaca 


cgtccgcagg 


gaacggaggg 


aggcagccct 


1200 


ggtgcgcaga 


gccgaggacg 


tacggcagtt 


ggtactccga 


gtccgagaag 


acctgaatcg 


1260 


tgctggtaag 


gttattagcg 


atggtcgtaa 


cgccgtcgtt 


cgtcgtgacc 


tccttgacct 


1320 


ggatgttgaa 


caacttgaac 


cgcagctttc 


tgggccggaa 


tccccagttg 


ttgttgatga 


1380 


gtcgctgcca 


gtcacgtggt 


gagaagtggc 


agtggaatct 


gttgaagtca 


aaatagcccc 


1440 


agggggtgct 


gtagccgaag 


aagtggttgt 


cgttggtagc 


cccgctctga 


cttgatatct 


1500 


gcttgtagag 


gtggttgttg 


taggtgggca 


gggcccaggt 


gcgggtgctg 


gtggtgatga 


1560 


ctctgtcgcc 


cagccatgtg 


gaatcgcaat 


gccaatttcc 


ggaggcatta 


cccactccgt 


1620 
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cggcgccttc 


gttattgtct 


gccattggtg 


cgccaccgcc 


tgcagccatt 


gtaccagatc 


1680 


ccagacctga 


gggcgcggcg 


ggaggttctc 


cgagaggttg 


ggggtcgggc 


actgactctg 


1740 


agtcgccagt 


ctgcccaaag 


ttgagcttct 


ttttagcggg 


ctgctggcct 


ttcttgccga 


1800 


tgcccgtgga 


ggagtcgggg 


gattctatgg 


gtctcttctt 


tccggaagcc 


gtcttagcgc 


1860 


cttcctcaac 


cagaccgaga 


ggttcgagaa 


cccgcttctt 


ggcctggaag 


actgctcgcc 


1920 


cgaggttgcc 


cccaaaagac 


gtatcttctt 


gaagacgctc 


ctgaaactcg 


gcgtcggcgt 


1980 


ggttgtactt 


gaggtacggg 


ttgtccccct 


gctcgagctg 


cttgtcgtag 


gccttgtcgt 


2040 


gctcgagggc 


cgcggcgtct 


gcctcgttga 


ccggctctcc 


cttgtcgagt 


ccgttgaagg 


2100 


gtccgaggta 


cttgtagcca 


ggaagcacca 


gaccccggcc 


gtcgtcctgc 


ttttgctggt 


2160 


tggctttggg 


tttcggggct 


ccaggtttca 


agtcccacca 


ctcgcgaatg 


ccctcagaga 


2220 


ggttgtcctc 


gagccaatct 


ggaagataac 


catcggcagc 


catacctggt 


ttaagtcatt 


2280 


tattgctcag 


aaacacagtc 


atccaggtcc 


acgttgacca 


gatcgcaggc 


cgagcaagca 


2340 


atctcgggag 


cccgccccag 


cagatgatga 


atggcacaga 


gtttccgata 


cgtcctcttt 


2400 


ctgacgaccg 


gttgagattc 


tgacacgccg 


gggaaacatt 


ctgaacagtc 


tctggtcccg 


2460 


tgcgtgaagc 


aaatgttgaa 


attctgattc 


attctctcgc 


atgtcttgca 


gggaaacagc 


2 52 0 


atctgaagca 


tgcccgcgtg 


acgagaacat 


ttgttttggt 


acctgtcggc 


aaagtccacc 


2580 


ggagctcctt 


ccgcgtctga 


cgtcgatgga 


tccgcgactg 


aggggcaagc 


ccgcttgggc 


2640 


tcgcttttat 


ccgcgtcatc 


gggggcgggt 


ctcttgttgg 


ctccaccctt 


tctgacgtag 


2700 


aactcatgcg 


ccacctcggt 


cacgtgatcc 


tgcgcccagc 


ggaagaactc 


tttgacttcc 


2760 


tgctttgtca 


ccttgccaaa 


gtcatgctcc 


acracacrcaacr 


tgagt tcaaa 


tttgaacatc 


2820 


cggtcttgta 


acggctgctg 


gtgctcgaag 


gtggtgctgt 


tcccgtcaat 


cacggcgcac 


2880 


atgttggtgt 


tggaagtgac 


gatcacgggg 


gtgggatcga 


tctgggcgga 


cgacttgcac 


2940 


ttttggtcca 


cgcgcacctt 


gctgccgccg 


agaatggcct 


tggcggactc 


cacgaccttg 


3000 


gccgtcatct 


tgccctcctc 


ccaccagatc 


accatcttgt 


cgacgcaatc 


gttgaaggga 


3060 


aagttctcat 


tggtccagtt 


gacgcagccg 


aagggcgaat 


tc 




3102 



<210> 18 
<211> 3106 
<212> DNA 

<213> new AAV serotype, clone 7-2 

<400> 18 

gcggccgcga attcgccctt cgcagagacc aaagttcaac tgaaacgaat cagccggttt 60 
attgattaac aagtaattac aggttacggg tgaggtaacg ggtgccaatg gggcgaggct 120 
cagtataaac cccttcgttg ttgacagcaa attccacatt attagacttg gcataatttg 180 
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aggtgtactg 


aatctctgga 


ttccagcgtt 


tgctgttttc 


tttctgcagt 


tcccactcga 


240 


5 


tctccacgct 


gacctggccg 


gtgctgtact 


gcgtgataaa 


tgaggcaaac 


ttggcaggag 


300 




taaacacctc 


tggaggatta 


gcaggtaccg 


gggtgttttt 


gatgagaatt 


tgaggaggcg 


360 




ggtgtttgag 


tccaaatccg 


cccatcaggg 


gagacgggtg 


aaagttgccg 


tccgtgtgag 


420 


10 


gaattttggc 


ccagatggga 


ccctgcaggt 


acacgtcccg 


gttctgccag 


accatgccgg 


480 




gcagagcccc 


ctggctgttg 


acagtctgtg 


tctggggtcc 


ggccgtagac 


gattgcaggt 


540 




tgctggagac 


cacaccgtat 


tcttctgtag 


ccacgggatt 


ggtggttttg 


atctcctcct 


600 


15 


cgctggtcat 


tagcacgttt 


tccagcgttg 


tcttgttggc 


agcccccgtt 


ttgccaaaaa 


660 




ccagcactcc 


gttgatggga 


aagaactggt 


cctcgtcgtc 


cttgttggtg 


gccatggcta 


720 




cgcccgggtt 


ggttaatgaa 


tttctaccat 


tcagatggta 


tttagtggcc 


ccggtccagg 


780 


20 


caaagttact 


gttgttgttg 


ctgtctatgt 


tttttgacag 


tctctgctgc 


cgataacagg 


840 




gtccgggcag 


ccagttcttt 


gattgctcgg 


ccatggtgtt 


gggcccagcc 


tgatggaact 


900 




gcagctccct 


tgtggacccc 


gtagtgctct 


gggtccgggc 


caggtagtac 


aggtactggt 


9 60 


25 


cgatgagggg 


attcatcagc 


cggtccaggc 


tctggctgtg 


cgcatagctg 


ctgtggaaag 


1020 




gcacttcctc 


aaaggtgtag 


ctgaattcaa 


agttatcgcc 


cgttctcagc 


atctgagaag 


1080 




gaaagtactc 


caggcagtag 


aaggaggaac 


gtcccacaga 


ctgactgccg 


ttgtttagag 


1140 


30 


tcagatatcc 


gtactgagga 


atcatgaaca 


cgtccgcagg 


gaacggaggg 


aggcagccct 


1200 




ggtgcgcaga 


gccgaggacg 


tacggcagtt 


ggtactccga 


gtccgagaag 


acctgaatcg 


1260 




tgctggtaag 


gttattagcg 


atggtcgtaa 


cgccgtcgtt 


cgtcgtgacc 


tccttgacct 


1320 


35 


ggatgttgaa 


caacttgaac 


cgcagctttc 


tgggccggaa 


tccccagttg 


ttgttgatga 


1380 




gtcgctgcca 


gtcacgtggt 


gagaagtggc 


agtggaatct 


gttgaagtca 


aaatagcccc 


1440 




agggggtgct 


gtagccgaag 


aagtggttgt 


cgttggtagc 


cccgctctga 


cttgatatct 


1500 


40 


gcttgtagag 


gtggttgttg 


taggtgggca 


gggcccaggt 


gcgggtgctg 


gtggtgatga 


1560 




ctctgtcgcc 


cagccatgtg 


gaatcgcaat 


gccaatttcc 


ggaggcatta 


cccactccgt 


1620 




cggcgccttc 


gttattgtct 


gccattggtg 


cgccaccgcc 


tgcagccatt 


gtaccagatc 


1680 


45 


ccagacctga 


gggcgcggcg 


ggaggttctc 


cgagaggttg 


ggggtcgggc 


actgactctg 


1740 




agtcgccagt 


ctgcccaaag 


ttgagcttct 


ttttagcggg 


cggctggccg 


ttcttgccga 


1800 




tgcccgtgga 


ggagtcgggg 


gattctatgg 


gtctcttctt 


tccaggagcc 


gtcttagcgc 


1860 


50 


cttcctcaac 


cagaccgaga 


ggttcgagaa 


cccgcttctt 


ggcctggaag 


actgctcgcc 


1920 




cgaggttgcc 


cccaaaagac 


gtatcttctt 


gaagacgctc 


ctgaaactcg 


gcgtcggcgt 


1980 




ggttgtactt 


gaggtacggg 


ttgtccccct 


gctcgagctg 


cttgtcgtag 


gccttgtcgt 


2040 


55 


gctcgagggc 


cgcggcgtct 


gcctcgttga 


ccggctctcc 


cttgtcgagt 


ccgttgaagg 


2100 
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gtccgaggta 


cctgtagcca 


ggaagcacca 


gaccccggcc 


gtcgtcctgc 


ttttgctggt 


2160 


5 


tggctttggg 


tttcggggct 


ccaggtttca 


agtcccacca 


ctcgcgaatg 


ccctcagaga 


2220 




ggttgccctc 


gagccaatct 


ggaagataac 


catcggcagc 


catacctggt 


ttaagtcatt 


2280 




tattgctcag 


aaacacagtc 


atccaggtcc 


acgttggcca 


gatcgcaggc 


cgagcaagca 


2340 


10 


atctcgggag 


cccgccccag 


cagatgatga 


atggcacaga 


gtttccgata 


cgtcctcttt 


2400 




ctgacgaccg 


gttgagattc 


tgacacgccg 


gggaaacatt 


ctgaacagtc 


tctggtcccg 


2460 




tgcgtgaagc 


aaatgttgaa 


attctgattc 


attctctcgc 


atgtcttgca 


ggggaacagc 


2520 


15 


atctgaagca 


tgcccgcgtg 


acgagaacat 


ttgttttggt 


acctgtcggc 


aaagtccacc 


2580 




ggagctcctt 


ccgcgtctga 


cgtcgatgga 


tccgcgactg 


aggggcaggc 


ccgcttgggc 


2640 




tcgcttttat 


ccgcgtcatc 


gggggcgggt 


ctcttgttgg 


ctccaccctt 


tctgacgtag 


2700 


20 


aactcatacg 


ccacctcggt 


cacgtgatcc 


tgcgcccagc 


ggaagaactc 


tttgacttcc 


2760 




tgctttgtca 


ccttgccaaa 


gtcatgctcc 


agacggcggg 


tgagttcaaa 


tttgaacatc 


2820 




cggtcttgta 


acggctgctg 


gtgctcgaag 


gtggtgctgt 


tcccgtcaat 


cacggcgcac 


2880 


25 


atgttggtgt 


tggaagtgac 


gatcacgggg 


gtgggatcga 


tctgggcgga 


cgacttgcac 


2940 




ttttggtcca 


cgcgcacctt 


gctgccgccg 


agaatggcct 


tggcggactc 


cacgaccttg 


3000 




gccgtcatcc 


tgccctcctc 


ccaccagatc 


accatcttgt 


cgacgcaatc 


gttgaaggga 


3060 


30 


aagttctcat 


tggtccagtt 


gacgcagccg 


tagaaagggc 


gaattc 




3106 



<210> 19 

<211> 3105 

<212> DMA 

35 <213> new AAV serotype, clone Cl 

<400> 19 



40 



45 



50 



gaattcgccc 


ttgctgcgtc 


aactggacca 


atgagaactt 


tcccttcaac 


gattgcgtcg 


60 


acaagatggt 


gatctggtgg 


gaggagggca 


agatgaccgc 


caaggtcgtg 


gagtccgcca 


12 0 


aggccattct 


gggcggaagc 


aaggtgcgcg 


tggaccaaaa 


gtgcaagtca 


tcggcccaga 


180 


tcgaccccac 


gcccgtgatc 


gtcacctcca 


acaccaacat 


gtgcgccgtg 


atcgacggga 


240 


acagcaccac 


cttcgagcac 


cagcagccgc 


tgcaggaccg 


catgttcaag 


ttcgagctca 


300 


cccgccgtct 


ggagcacgac 


tttggcaagg 


tgaccaagca 


ggaagtcaaa 


gagttcttcc 


360 


gctgggctca 


ggatcacgtg 


actgaggtgg 


cgcatgagtt 


ctacgtcaga 


aagggcggag 


420 


ccaccaaaag 


acccgccccc 


agtgacgcgg 


atataagcga 


gcccaagcgg 


gcctgcccct 


480 


cagttgcgga 


gccatcgacg 


tcagacgcgg 


aagcaccggt 


ggactttgcg 


gacaggtacc 


54 0 


aaaacaaatg 


ttctcgtcac 


gcgggcatgc 


ttcagatgct 


gtttccctgc 


aagacatgcg 


60 0 


agagaatgaa 


tcagaatttc 


aacgtctgct 


tcacgcacgg 


ggtcagagac 


tgctcagagt 


660 


gcttccccgg 


cgcgtcagaa 


tctcaacccg 


tcgtcagaaa 


aaagacgtat 


cagaaactgt 


720 
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gcgcgattca 


tcatctgctg 


gggcgggcac 


ccgagattgc 


gtgttcggcc 


cgcgatctcg 


780 


tcaacgtgga 


cttggatgac 


tgtgtttctg 


agcaataaat 


gacttaaacc 


aggtatggct 


840 


gctgacggtt 


atcttccaga 


ttggctcgag 


gacaacctct 


ctgagggcat 


tcgcgagtgg 


900 


tgggacctga 


aacctggagc 


ccccaagccc 


aaggccaacc 


agcagaagca 


ggacgacggc 


960 


cggggtctgg 


tgcttcctgg 


ctacaagtac 


ctcggaccct 


tcaacggact 


cgacaagggg 


1020 


gagcccgtca 


acgcggcgga 


cgcagcggcc 


ctcgagcacg 


acaaggccta 


cgaccagcag 


1080 


ctcaaagcgg 


gtgacaatcc 


gtacctgcgg 


tataaccacg 


ccgacgccga 


gtttcaggag 


1140 


cgtctgcaag 


aagatacgtc 


ttttgggggc 


aacctcgggc 


gagcagt ctt 


ccaggccaag 


1200 


aagagggtac 


tcgaacctct 


gggcctggtt 


gaagaaggtg 


ctaagacggc 


tcctggaaag 


1260 


aagagaccgt 


tagagtcacc 


acaagagccc 


gactcctcct 


caggaatcgg 


caaaaaaggc 


1320 


aaacaaccag 


ccaaaaagag 


actcaacttt 


gaagaggaca 


ctggagccgg 


agacggaccc 


1380 


cctgaaggat 


cagataccag 


cgccatgtct 


tcagacattg 


aaatgcgtgc 


agcaccgggc 


1440 


ggaaatgctg 


tcgatgcggg 


acaaggttcc 


gatggagtgg 


gtaatgcctc 


gggtgattgg 


1500 


cattgcgatt 


ccacctggtc 


tgagggcaag 


gtcacaacaa 


cctcgaccag 


aacctgggtc 


1560 


ttgcccacct 


acaacaacca 


cttgtacctg 


cggctcggaa 


caacatcaaa 


cagcaacacc 


1620 


tacaacggat 


tctccacccc 


ctggggatac 


tttgacttta 


acagattcca 


ctgtcacttc 


1680 


tcaccacgtg 


actggcaaag 


actcatcaac 


aacaactggg 


gactacgacc 


aaaagccatg 


1740 


cgcgttaaaa 


tcttcaatat 


ccaagttaag 


gaggtcacaa 


cgtcgaacgg 


cgagactacg 


1800 


gtcgctaata 


accttaccag 


cacggttcag 


atatttgcgg 


actcgtcgta 


tgagctcccg 


1860 


tacgtgatgg 


acgctggaca 


agagggaagt 


ctgtctcctt 


tccccaatga 


cgtcttcatg 


1920 


gtgcctcaat 


atggctactg 


tggcattgtg 


actggcgaaa 


atcagaacca 


gacggacaga 


1980 


aatgctttct 


actgcctgga 


gtattttcct 


tcacaaatgc 


tgagaactgg 


caataacttt 


2040 


gaaatggctt 


acaactttgg 


gaaggtgccg 


ttccactcaa 


tgtatgctta 


cagccagagc 


2100 


ccggacagac 


tgatgaatcc 


cctcctggac 


cagtacctgt 


ggcacttaca 


gtcgaccacc 


2160 


tctggagaga 


ctctgaatca 


aggcaatgca 


gcaaccacat 


ttggaaaaat 


caggagtgga 


2220 


gactttgcct 


tttacagaaa 


gaactggctg 


cctgggcctt 


gtgttaaaca 


gcagagactc 


2280 


t caaaaact g 


ccag'tcaaaa 


ti ac a a ga'k 


cctgccagcg 


ggggcaacgc 


uc ugux^aaag 


9 "5 A n 

^ O *i U 


tatgacaccc 


actatacctt 


aaacaaccgc 


tggagcaaca 


tagcgcctgg 


acctccaatg 


2400 


gcaacagctg 


gaccttcaga 


tggggacttc 


agcaacgccc 


agctcatctt 


ccctggacca 


2460 


tcagtcaccg 


gaaacacaac 


aacctcagca 


aacaatctgt 


tgtttacatc 


agaagaagaa 


2520 


attgctgcca 


ccaacccaag 


agacacggac 


atgtttggtc 


agattgctga 


caataatcag 


2580 


aatgctacaa 


ctgctcccat 


aaccggcaac 


gtgactgcta 


tgggagtgct 


tcctggcatg 


2640 
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gtgtggcaaa 


acagagacat 


ttactaccaa 


gggccaattt 


gggccaagat 


cccacacgcg 


2700 


gacggacatt 


ttcatccttc 


accgctaatt 


ggcggttttg 


gactgaaaca 


tccgcctccc 


2760 


cagatattta 


tcaaaaacac 


ccccgtacct 


gccaatcctg 


cgacaacctt 


cactgcagcc 


2820 


agagtggact 


ctttcatcac 


acaatacagc 


accggccagg 


tcgctgttca 


gattgaatgg 


2880 


gaaatcgaaa 


aggaacgctc 


caaacgctgg 


aatcctgaag 


tgcagtttac 


ttcaaactat 


2940 


gggaaccagt 


cttctatgtt 


gtgggctccc 


gatacaactg 


ggaagtatac 


agagccgcgg 


3000 


gttattggct 


ctcgttattt 


gactaatcat 


ttgtaactgc 


ctagttaatc 


aataaaccgt 


3060 


gtgattcgtt 


tcagttgaac 


tttggtctct 


gcgaagggcg 


aattc 




3105 



<210> 20 

<211> 3105 

20 <212> DNA 

<213> new AAV serotype, clone C3 

<400> 20 





gaattcgccc 


ttgctgcgtc 


aactggacca 


atgagaactt 


tcccttcaac 


gattgcgtcg 


60 


25 


acaagatggt 


gatctggtgg 


gaggagggca 


agatgaccgc 


caaggtcgtg 


gagtccgcca 


120 




aggccattct 


gggcggaagc 


aaggtgcgcg 


tggaccaaaa 


gtgcaagtca 


tcggcccaga 


180 




tcgaccccac 


gcccgtgatc 


gtcacctcca 


acaccaacat 


gtgcgccgtg 


atcgacggga 


240 


30 


acagcaccac 


cttcgagcac 


cagcagccgc 


tgcaggaccg 


catgttcaag 


ttcgagctca 


300 




cccgccgtct 


ggagcacgac 


tttggcaagg 


tgaccaagca 


ggaagtcaaa 


gagttcttcc 


360 




gctgggctca 


ggatcacgtg 


actgaggtgg 


cgcatgagtt 


ctacgtcaga 


aagggcggag 


420 


35 


ccaccaaaag 


acccgccccc 


agtgacgcgg 


atataagcga 


gcccaagcgg 


gcctgcccct 


480 




cagttgcgga 


gccatcgacg 


tcagacgcgg 


aagcaccggt 


ggactttgcg 


gacaggtacc 


540 




aaaacaaatg 


ttctcgtcac 


gcgggcatgc 


ttcagatgct 


gtttccctgc 


aagacatgcg 


600 


40 


agagaatgaa 


tcagaatttc 


aacgtctgct 


tcacgcacgg 


ggt cagagac 


tgctcagagt 


660 




gcttccccgg 


cgcgtcagaa 


tctcaacccg 


tcgtcagaaa 


aaagacgtat 


cagaaactgt 


720 




gcgcgattca 


tcatctgctg 


gggcgggcac 


ccgagattgc 


gtgttcggcc 


tgcgatctcg 


780 


45 


tcaacgtgga 


cttggatgac 


tgtgtttctg 


agcaataaat 


gacttaaacc 


aggtatggct 


840 




gctgacggtt 


atcttccaga 


ttggctcgag 


gacaacctct 


ctgagggcat 


tcgcgagtgg 


900 




tgggacctga 


aacctggagc 


ccccaagctc 


aaggccaacc 


agcagaagca 


ggacgacggc 


960 


50 


cggggtctgg 


tgcttcctgg 


ctacaagtac 


ctcggaccct 


tccacggact 


cgacaagggg 


1020 




gagcccgtca 


acgcggcgga 


cgcagcggcc 


ct cgagcacg 


acaaggcct a 


cgaccagcag 


1080 




ctcaaagcgg 


gtgacaatcc 


gtacctgcgg 


tataaccacg 


ccgacgccga 


gtttcaggag 


1140 


55 


cgtctgcaag 


aagatacgtc 


ttttgggggc 


aacctcgggc 


gagcagtctt 


ccaggccaag 


1200 
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aagagggtac 


tcgaaccact 


gggcctggtt 


gaagaaggtg 


ctaagacggc 


tcctggaaag 


1260 


aagagaccgt 


tagagtcacc 


acaagagccc 


gactcctcct 


caggaatcgg 


caaaaaaggc 


1320 


aaacaaccag 


ccaaaaagag 


actcaacttt 


gaagaggaca 


ctggagccgg 


agacggaccc 


1380 


cctgaaggat 


cagataccag 


cgccatgtct 


tcagacattg 


aaatgcgtgc 


agcaccgggc 


1440 


ggaaatgctg 


tcgatgcggg 


acaaggttcc 


gatggagtgg 


gtaatgcctc 


gggtgattgg 


1500 


cattgcgatt 


ccacctggtc 


tgagggcaag 


gtcacaacaa 


cctcgaccag 


aacctgggtc 


1560 


ttgcccacct 


acaacaacca 


cttgtacctg 


cggctcggaa 


caacatcaaa 


cagcaacacc 


1620 


tacaacggat 


tctccacccc 


ctggggatac 


tttgacttta 


acagattcca 


ctgtcacttc 


1680 


tcaccacgtg 


actggcaaag 


actcatcaac 


aacaactggg 


gactacgacc 


aaaagccatg 


1740 


cgcgttaaaa 


tcttcaatat 


ccaagttaag 


gaggtcacaa 


cgtcgaacgg 


cgagactacg 


1800 


gtcgctaata 


accttaccag 


cacggttcag 


atatttgcgg 


actcgtcgta 


tgagctcccg 


1860 


tacgtgatgg 


acgctggaca 


agagggaagt 


ctgcctcctt 


tccccaatga 


cgtcttcatg 


1920 


gtgcctcaat 


atggctactg 


tggcattgtg 


actggcgaaa 


atcagaacca 


gacggacaga 


1980 


aatgctttct 


actgcctgga 


gtattttcct 


tcacaaatgc 


tgagaactgg 


caataacttt 


2040 


gaaatggctt 


acaactttga 


gaaggtgccg 


ttccactcaa 


tgtatgctca 


cagccagagc 


2100 


ctggacagac 


tgatgaatcc 


cctcctggac 


cagtacctgt 


ggcacttaca 


gtcgaccacc 


2160 


tctggagaga 


ctctgaatca 


aggcaatgca 


gcaaccacat 


ttggaaaaat 


caggagtgga 


2220 


gactttgcct 


tttacagaaa 


gaactggctg 


cctgggcctt 


gtgttaaaca 


gcagagattc 


2280 


tcaaaaactg 


ccagtcaaaa 


ttacaagatt 


cctgccagcg 


ggggcaacgc 


tctgttaaag 


2340 


tatgacaccc 


actatacctt 


aaacaaccgc 


tggagcaaca 


tagcgcctgg 


acctccaatg 


2400 


gcaacagctg 


gaccttcaga 


tggggacttc 


agcaacgccc 


agctcatctt 


ccctggacca 


2460 


tcagtcaccg 


gaaacacaac 


aacctcagca 


aacaatctgt 


tgtttacatc 


agaaggagaa 


2520 


attgctgcca 


ccaacccaag 


agacacggac 


atgtttggtc 


agattgctga 


caataatcag 


2580 


aatgctacaa 


ctgctcccat 


aaccggcaac 


gtgactgcta 


tgggagtgct 


tcctggcatg 


2640 


gtgtggcaaa 


acagagacat 


ttactaccaa 


gggccaattt 


gggccaagat 


cccacacgcg 


27C0 


gacggacatt 


ttcatccttc 


accgctaatt 


ggcggttttg 


gactgaaaca 


tccgcctccc 


2760 


cagatattta 


tcaaaaacac 


ccccgtacct 


gccaatcctg 


cgacaacctt 


cactgcagcc 


2820 


agagtggact 


ctttcatcac 


acaatacagc 


accggccagg 


tcgctgttca 


gattgaatgg 


2880 


gaaatcgaaa 


aggaacgctc 


caaacgccgg 


aatcctgaag 


tgcagtttac 


ttcaaactat 


2940 


gggaaccagt 


cttctatgtt 


gtgggctccc 


gatacaactg 


ggaagtatac 


agagccgcgg 


3000 


gttattggct 


ctcgttattt 


gactaatcat 


ttgtaactgc 


ctagttaatc 


aataaaccgt 


3060 


gtgattcgtt 


tcagttgaac 


tttggtctct 


gcgaagggcg 


aattc 




3105 
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<210> 21 

<211> 3105 

<212> DNA 

<213> new AAV serotype, clone C5 

<400> 21 





gaattcgccc 


ttcgcagaga 


ccaaagttca 


actgaaacga 


atcacacggt 


ttattgatta 


60 


10 


actaggcagt 


tacaaatgat 


tagtcaaata 


acgagagcca 


ataacccgcg 


gctctgtata 


120 




cttcccagtt 


gtatcgggag 


cccacaacat 


agaagactgg 


ttcccacagt 


ttgaagtaaa 


180 




ctgcacttca 


ggattccagc 


gtttggagcg 


ttccttttcg 


atttcccatt 


caatctgaac 


240 


15 


agcgacctgg 


ccggtgctgt 


attgtgtgat 


gaaagagtcc 


actctggctg 


cagtgaaggt 


300 




tgtcgcagga 


taggcaggta 


cgggggtgtt 


tttgataaat 


atctggggag 


gcggatgttt 


360 




cagtccaaaa 


ccgccaatta 


gcggtgaagg 


atgaaaatgt 


ccgtccgcgt 


gtgggatctt 


420 


20 


ggcccaaatt 


ggcccttggt 


agtaaatgtc 


tctgttttgc 


cacaccatgc 


caggaagcac 


480 




tcccatagca 


gtcacgttgc 


cggttatggg 


agcagttgta 


gcattctgat 


tattgtcagc 


540 




aatctgacca 


aacatgtccg 


tgtctcttgg 


gttggtggca 


gcaatttctt 


cttctgatgt 


600 




aaacaacaga 


ttgtttgctg 


aggttgttgt 


gtttccggtg 


actgatggtc 


cagggaagat 


660 




gagctgggcg 


ttgctgaagt 


ccccatctga 


aggtccagct 


gttgccattg 


gaggtccagg 


720 




cgctatgttg 


ctccagcggt 


tgtttaaggt 


atagtgggtg 


tcatacttta 


acagagcgtt 


780 


30 


gcccccgctg 


gcaggaatct 


tgtaattttg 


actggcagtt 


tttgagaatc 


tctgctgttt 


840 




aacacaaggc 


ccaggcagcc 


agttctttct 


gtaaaaggca 


aagtctccac 


tcctgatttt 


900 




tccaaatgtg 


gttgctgcat 


tgccttgatt 


cagagtctct 


ccagaggtgg 


tcgactgtaa 


960 


35 


gtgccacagg 


tactggtcca 


ggaggggatt 


catcagtccg 


tccaggctct 


ggctgtgagc 


1020 




atacattgag 


tggaacggca 


ccttctcaaa 


gttgtaagcc 


gtttcaaagt 


tattgccagt 


1080 




tctcagcatt 


tgtgaaggaa 


aatactccag 


gcagtagaaa 


gcatttctgt 


ccgtctggtt 


1140 


40 


ctgattttcg 


ccagtcacaa 


tgccacagta 


gccatattga 


ggcaccatga 


agacgtcatt 


1200 




ggggaaagga 


ggcagacttc 


cctcttgtcc 


agcgtccatc 


acgtacggga 


gctcatacga 


1260 






ctd L>ci ud 




aagguL,a\-ua 


gcgaccg tag 


tctcgccgtt 


± J z u 


45 


cgacgttgtg 


acctccttaa 


cttggatatt 


gaagatttta 


acgcgcatgg 


cttttggtcg 


1380 




tagtccccag 


ttgttgttga 


tgagtctttg 


ccagtcacgt 


ggtgagaagt 


gacagtggaa 


1440 




tctgttaaag 


tcaaagtatc 


cccagggggt 


ggagaatccg 


ttgtaggtgt 


tgctgtttga 


1500 


50 


tgttgttccg 


agccgcaggt 


acaagtggtt 


gttgtaggtg 


ggcaagaccc 


aggttctggt 


1560 




cgaggttgtt 


gtgaccttgc 


cctcagacca 


ggtggaatcg 


caatgccaat 


cacccgaggc 


1620 




attacccact 


ccatcggaac 


cttgtcccgc 


atcgacagca 


tttccgcccg 


gtgctgcacg 


1680 


55 


catttcaatg 


tctgaagaca 


tggcgctggt 


atctgatcct 


tcagggggtc 


cgtctccggc 


1740 
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tccagtgtcc 


tcttcaaagt 


tgagtctctt 


tttggctggt 


tgtttgcctt 


ttttgccgat 


1800 


5 


tcctgaggag 


gagtcgggct 


cttgtggtga 


ctctaacggt 


ctcttctttc 


caggagccgt 


1850 




cttagcacct 


tcttcaacca 


ggcccagagg 


ttcgagtacc 


ctcttcttgg 


cctggaagac 


1920 




tgctcgcccg 


aggttgcccc 


caaaagacgt 


atcttcttgc 


agacgctcct 


gaaactcggc 


1980 


10 


gtcggcgtgg 


ttataccgca 


ggtacggatt 


gtcacccgct 


ttgagctgct 


ggtcqtaqac 


204 0 




cttgtcgtgc 


tcgagggccg 


ctgcgtccgc 


cgcgttgacg 


ggctccccct 


tgtcgagtcc 


2100 




gttgaagggt 


ccgaggtact 


cgtagccagg 


aagcaccaga 


ccccggccgt 


cgtcctgctt 


2160 


15 


ctgctggttg 


gccttgggct 


"tgqgggctcc 


aggtttcagg 


tcccaccact 


cgcgaatgcc 


2220 




ctcagagagg 


ttgtcctcga 


gccaatctgg 


aagataaccg 


tcagcagcca 


tacctggttt 


2280 




aagtcattta 


ttgctcagaa 


acacagtcat 


ccaagtccac 


gttgacgaga 


tcgcaggccg 


2340 


20 


aacacgcaat 


ctcgggtgcc 


cgccccagca 


gatgatgaat 


cgcgcacagt 


ttctgatacg 


2400 




tcttttttct 


gacgacgggt 


tgagattctg 


acgcgccggg 


gaagcactct 


gagcagtctc 


2460 




tgaccccgtg 


cgtgaagcag 


acgttgaaat 


tctgattcat 


tctctcgcat 


gtcttgcagg 


2520 


25 


gaaacagcat 


ctgaagcatg 


cccgcgtgac 


gagaacattt 


gttttggtac 


ctgtccgcaa 


2580 




ggtccaccgg 


tgcttccgcg 


tctgacgtcg 


atggctccgc 


aactaaggag 


caggcccgct 


2640 




tgggctcgct 


tatatccgcg 


tcactggggg 


cgggtctttt 


ggtggctccg 


ccctttctga 


2700 


30 


cgtagaactc 


atgcgccacc 


tcagtcacgt 


gatcctgagc 


ccagcggaag 


aactctttga 


2760 




cttcctgctt 


ggtcaccttg 


ccaaagtcgt 


gctccagacg 


gcgggtgagc 


tcgaacttga 


2820 




acatgcggtc 


ctgcagcggc 


tgctggtgct 


cgaaggtggt 


gctgttcccg 


tcgatcacgg 


2880 


35 


cgcacatgtt 


ggtgttggag 


gtgacgatca 


cgggcgtggg 


gtcgatctgg 


gccgatgact 


2940 




tgcacttttg 


gtccacgcgc 


accttgcttc 


cgcccagaat 


ggccttggcg 


gactccacga 


3000 




ccttggcggt 


catcttgccc 


tcctcccacc 


agatcaccat 


cttgtcgacg 


caatcgttga 


3060 


40 


agggaaagtt 


ctcattggtc 


cagttgacgc 


agcaagggcg 


aattc 




3105 


45 


<210> 22 
<211> 309^ 
<212> DNA 
<213> new 


I 

AAV serotype, clone Fl 










<400> 22 
gaatt cgccc 


ttgctgcgtc 


aactggacca 


agagaacttt 


ccctticaacg 


at'"t~rrr'fTt*f*cfa 


60 


50 


caagatggtg 


atctggtggg 


aggagggcaa 


gatgacggcc 


aaggtcgtgg 


agtccgccaa 


12 0 




agccattctg 


ggcggaagca 


aggtgcgcgt 


cgaccaaaag 


tgcaagtcct 


cggcccagat 


180 




cgatcccacc 


cccgtgatcg 


tcacctccaa 


caccaacatg 


tgcgccgtga 


tcgacgggaa 


240 


55 


cagcaccacc 


ttcgagcacc 


agcagccgtt 


gcaggaccgg 


atgttcaaat 


ttgaactcac 


300 
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ccgccgtctg 


gaacacgact 


ttggcaaggt 


gaccaagcag 


gaagtcaaag 


agttcttccg 


360 


ctgggctagt 


gatcacgtga 


ctgaggtgac 


gcatgagttc 


tacgtcagaa 


agggcggagc 


420 


cagcaaaaga 


cccgcccccg 


atgacgcgga 


tataagcgag 


cccaagcggg 


cctgtccctc 


480 


agtcacggac 


ccatcgacgt 


cagacgcgga 


aggagctccg 


gtggactttg 


ccgacaggta 


540 


ccaaaacaaa 


tgttctcgtc 


acgcgggcat 


gcttcagatg 


ctgtttccct 


gcaaaacgtg 


600 


cgagagaatg 


aatcagaatt 


tcaacatttg 


cttcacgcac 


ggggtcagag 


actgtttaga 


660 


atgtttcccc 


ggcgtgtcag 


aatctcaacc 


ggtcgtcaga 


aaaaagacgt 


atcggaagct 


720 


gtgtgcgatt 


catcatctgc 


tggggcgggc 


acccgagatt 


gcttgctcgg 


cctgcgacct 


780 


ggtcaacgtg 


gacctggacg 


actgtgtttc 


tgagcaataa 


atgacttaaa 


ccgggtatgg 


340 


ctgccgatgg 


ttatcttcca 


gattggctcg 


aggacaacct 


ctctgagggc 


attcgcgagt 


900 


ggtgggacct 


gaaacctgga 


gccccgaaac 


ccaaagccaa 


ccagcaaaag 


caggacgacg 


960 


gccggggtct 


ggtgcttcct 


ggctacaagt 


acctcggacc 


cttcaacgga 


ctcgacaagg 


1020 


gggagcccgt 


caacgcggcg 


gacgcagcgg 


ccctcgagca 


cgacaaggcc 


tacgaccagc 


1080 


agctcaaagc 


gggtgacaat 


ccgtacctgc 


ggtataacca 


cgccgacgcc 


gagtttcagg 


1140 


agcgtctgca 


agaagatacg 


tcatttgggg 


gcaacctcgg 


gcgagcagtc 


ttccaggcca 


1200 


agaagcgggt 


tctcgaacct 


ctcggtctgg 


ttgaggaagg 


cgctaagacg 


gctcctggaa 


1260 


agaagagacc 


catagactct 


ccagactcct 


ccacgggcat 


cggcaaaaaa 


ggccagcagc 


1320 


ccgctaaaaa 


gaagctcaat 


tttggtcaga 


ctggcgactc 


agagtcagtc 


cccgaccctc 


1380 


aacctcttgg 


agaacctcca 


gcagcgccct 


ctagtgtggg 


atctggtaca 


atggctgcag 


1440 


gcggtggcgc 


accaatggca 


gacaataacg 


aaggtgccga 


cggagtgggt 


aatgcctcag 


1500 


gaaattggca 


ttgcgattcc 


acatggctgg 


gcgacagagt 


catcaccacc 


agcaccagaa 


1560 


cctgggccct 


ccccacctac 


aacaaccacc 


tctacaagca 


aatctccagc 


agcagctcag 


1620 


gagccaccaa 


tgacaaccac 


tacttcggct 


acagcacccc 


ctgggggtat 


tttgacttta 


1680 


acagattcca 


ctgccacttc 


tcaccacgtg 


actggcagcg 


actcatcaac 


aacaactggg 


1740 


gattccggcc 


caagaagctg 


cggttcaagc 


tcttcaacat 


ccaggtcaag 


gaggtcacaa 


1800 


cgaatgacgg 


cgtcacgacc 


atcgctaata 


accttaccag 


cacggttcag 


gtcttctcgg 


1860 


actcggaata 


ccagctgccg 


tacgtcctcg 


gctctgcgca 


ccagggctgc 


ctgcctccgt 


1920 


tcccggcgga 


cgtcttcatg 


attcctcagt 


acggctacct 


gactctgaac 


aacggcagcc 


1980 


aatcggtggg 


ccgttcctcc 


ttctactgcc 


tggaatattt 


cccctctcaa 


atgctgagaa 


2040 


cgggcaacaa 


ctttgagttc 


agttacagct 


tcgaggacgt 


gcctttccac 


agcagctacg 


2100 


cgcacagcca 


gagcctagac 


cggctgatga 


accctctcat 


cgaccagtac 


ctgtactacc 


2160 


tggcccggac 


ccagagcacc 


acgggttcca 


ccagggaact 


gcaatttcat 


caagctgggc 


2220 



86 



EP1 310 571 A2 



10 



15 



20 



ccaatactat 


ggccgagcag 


tcaaagaact 


ggctgcctgg 


accctgctat 


aggcaacagg 


2280 


gactgtcaaa 


gaacttggac 


tttaacaaca 


acagcaattt 


tgcctggact 


gctgccacta 


2340 


aatatcatct 


gaatggcaga 


aactctttga 


ccaatcctgg 


cattcccatg 


gcaaccaaca 


2400 


aggatgatga 


ggaccagttc 


tttcccatca 


acggggtact 


ggtttttggc 


aagacgggag 


2460 


ctgccaacaa 


aactacgctg 


gaaaacgttc 


tgatgaccag 


cgaggaggag 


atcaagacca 


2520 


ctaaccctgt 


ggctacagaa 


gaatacggtg 


tggtctccag 


caacctgcag 


ccgtctacag 


2580 


ccgggcctca 


atcacagact 


atcaacagcc 


agggagcact 


gcctggcatg 


gtctggcaga 


2640 


accgggacgt 


gtatctgcag 


ggtcccatct 


gggccaaaat 


tcctcacacg 


gatggcaact 


2700 


ttcacccgtc 


tcctctgatg 


ggcggttttg 


gactcaaaca 


cccgcctcca 


cagatcctga 


2760 


tcaaaaacac 


acctgtacct 


gctaatcctc 


cggaggtgtt 


tactcctgcc 


aagtttgcct 


2820 


ccttcatcac 


gcagtacagc 


accggacaag 


tcagcgtgga 


aatcgagtgg 


gagctgcaga 


2880 


aagaaaacag 


caagcgctgg 


aacccagaaa 


ttcagtatac 


ttccaattat 


gccaagtcta 


2940 


ataatgttga 


atttgctgtg 


aaccctgatg 


gtgtttatac 


tgagcctcgc 


cccattggca 


3000 


ctcgttacct 


cccccgtaat 


ctgtaattgc 


ttgttaatca 


ataaaccggt 


tgattcgttt 


3060 


cagttgaact 


ttggtctctg 


cgaagggcga 


attc 






3094 



<210> 23 

30 <211> 3095 

<212> DNA 

<213> new AAV serotype, clone F3 

<400> 23 



35 


gaattcgccc 


ttcgcagaga 


ccaaagttca 


actgaaacga 


atcaaccggt 


ttattgatta 


60 




acaagcaatt 


acagattacg 


ggtgaggtaa 


cgagtgccaa 


tggggcgagg 


ctcagtataa 


120 




acaccatcag 


ggttcacagc 


aaattcaaca 


ttattagact 


tggcataatt 


ggaagtatac 


180 


40 


tgaatttctg 


ggttccagcg 


cttgctgttt 


tctttctgca 


gctcccactc 


gatttccacg 


240 




ctgacttgtc 


cggtgctgta 


ctgcgtgatg 


aaggaggcaa 


acttggcagg 


agtaaacacc 


300 




tccggaggat 


tagcaggtac 


aggtgtgttt 


ttgatcagga 


tctgtggagg 


cgggtgtttg 


360 


45 


agtccaaaac 


cgcccatcag 


aggagacggg 


tgaaagttgc 


catccgtgtg 


aggaattttg 


420 




gcccagatgg 


gaccctgcag 


atacacgtcc 


cggttctgcc 


agaccatgcc 


aggcagtgct 


480 




ccctggctgt 


tgatagtctg 


tgattgaggc 


ccggctgtag 


acgactgcag 


gttgctggag 


540 


50 


accacaccgt 


attcttctgt 


agccacaggg 


ttagtggtct 


tgatctcctc 


ctcgctggtc 


600 




atcagaacgt 


tttccagcgt 


agttttgttg 


gcagctcccg 


tcttgccaaa 


aaccagtacc 


660 




ccgttgatgg 


gaaagaactg 


gtcctcatca 


tccttgttgg 


ttgccatggg 


aatgccagga 


720 


55 


ttggtcaaag 


agtttctgcc 


attcagatga 


tatttagtgg 


cagcagtcca 


ggcaaaattg 


780 
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ctgttgttgt 


taaagtccaa 


gttctttgac 


agtctctgtt 


gcctatagca 


gggtccaggc 


840 


agccagttct 


ttgactgctc 


ggccatagta 


ttgggcccag 


cttgatgaaa 


ttgcagttcc 


900 


ctggtggaac 


ccgtggtgct 


ctgggtccgg 


gccaggtagt 


acaggtactg 


gtcgatgaga 


960 


gggttcatca 


gccggtctag 


gctctggctg 


tgcgcgtagc 


tgctgtggaa 


aggcacgtcc 


1020 


tcgaagctgt 


aactgaactc 


aaagttgttg 


cccgttctca 


gcatttgaga 


ggggaaatat 


1080 


tccaggcagt 


agaaggagga 


acggcccacc 


gattggctgc 


cgttgtccag 


agtcaggtag 


1140 


ccgtactgag 


gaatcatgaa 


gacgtccgcc 


gggaacggag 


gcaggcagcc 


ctggtgcgca 


12C0 


gagccgagga 


cgtacggcag 


ctggtattcc 


gagtccgaga 


agacctgaac 


cgtgctggta 


1260 


aggttattag 


cgatggtcgt 


gacgccgtca 


ttcgttgtga 


cctccttgac 


ctggatgttg 


1320 


aggagcttga 


accgcagctt 


cttgggccgg 


aatccccagt 


tgttgttgat 


gagtcgctgc 


1380 


cagtcacgtg 


gtgagaagtg 


gcagtggaat 


ctgttaaagt 


caaaataccc 


ccagggggtg 


1440 


ctgtagccga 


agtagtggtt 


gtcattggtg 


gctcctgagc 


tgctgctgga 


gatttgcttg 


1500 


tagaggtggt 


tgttgtaggt 


ggggsgggcc 


caggttctgg 


tgctggtggt 


gatgactctg 


1560 


tcgcccagcc 


atgtggaatc 


gcaatgccaa 


tttcctgagg 


cattacccac 


tccgtcggca 


1620 


ccttcgttat 


tgtctgccat 


tggtgcgcca 


ccgcctgcag 


ccattgtacc 


agatcccaca 


1680 


ctagagggcg 


ctgctggagg 


ttctccaaga 


ggttgagggt 


cggggactga 


ctctgagtcg 


1740 


ccagtctgac 


caaaattgag 


cttcttttta 


gcgggctgct 


ggcctttttt 


gccgatgccc 


1800 


gtggaggagt 


ctggagagcc 


tatggctctc 


ttctttccag 


gagccgtctt 


agcgccttcc 


1860 


tcaaccagac 


cgagaggttc 


gagaacccgc 


ttcttggcct 


ggaagactgc 


tcgcccgagg 


1920 


ttgcccccaa 


atgacgtatc 


ttcttgcaga 


cgctcctgaa 


actcggcgtc 


ggcgtggtta 


1980 


taccgcaggt 


acggattgtc 


acccgctttg 


agctgctggt 


cgtaggcctt 


gtcgtgctcg 


2040 


^gggccgctg 


cgtccgccgc 


gttgacgggc 


tcccccttgt 


cgagtccgtt 


gaagggtccg 


2100 


aggtacttgt 


agccaggaag 


caccagaccc 


cggccgtcgt 


cctgcttttg 


ctggttggct 


2160 


ttgggtttcg 


gggctccagg 


tttcaggtcc 


caccactcgc 


gaatgccctc 


agagaggttg 


2220 


tcctcgagcc 


aatctggaag 


ataaccatcg 


gcagccatac 


ctggtttaag 


tcatttattg 


2280 


ctcagaaaca 


cagtcgtcca 


ggtccacgtt 


gaccaggtcg 


caggccgagc 


aagcaatctc 


2340 


gggtgcccgc 


cccagcagat 


gatgaatcgc 


acacagcttc 


cgatacgtct 


tttttctgac 


2400 


gaccggttga 


gattctgaca 


cgccggggaa 


acattctaaa 


cagtctctga 


ccccgtgcgt 


2460 


gaagcaaatg 


ttgaaattct 


gattcattct 


ctcgcacgtt 


ttgcagggaa 


acagcacctg 


2520 


aagcatgccc 


gcgtgacgag 


aacatttgtt 


ttggtacctg 


tcggcaaagt 


ccaccggagc 


2580 


tccttccgcg 


tctgacgtcg 


atgggtccgt 


gactgaggga 


cgggcccgct 


tgggctcgct 


2640 


tatatccgcg 


tcatcggggg 


cgggtctttt 


gctggctccg 


ccctttctga 


cgtagaactc 


2700 
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atgcgtcacc 


tcagtcacgt 


gatcactagc 


ccagcggaag 


aactctttga 


cttcctgctt 


2760 


tgtcaccttg 


ccaaagtcgt 


gttccagacg 


gcgggtgagt 


tcaaatttga 


acatccggtc 


282C 


ctgcaacggt 


tgctggtgct 


cgaaggtggt 


gctgttcccg 


tcgatcacgg 


cgcacatgtt 


288C 


ggtgttggag 


gtgacgatca 


cgggggtggg 


atcgatctgg 


gcggacgact 


tgcacttttg 


294C 


gtccacgcgc 


accttgctgc 


cgccgagaat 


ggccttggcg 


gactccacga 


ccttggccgt 


300C 


catcttgccc 


tcctcccacc 


agatcaccat 


cttgtcgacg 


caatcgttga 


agggaaagtt 


3060 


ctcattggtc 


cagttgacgc 


agcaagggcg 


aattc 






3095 



<210> 24 

<211> 3095 

<212> DNA 

<213> new AAV serotype, clone F5 

<400> 24 



20 


gaattcgccc 


ttcgcagaga 


ccaaagttca 


actgaaacga 


atcaaccggt 


ttattgatta 


60 




acaagcaatt 


acagattacg 


ggtgaggtaa 


cgagtgccaa 


tggggcgagg 


ctcagtataa 


120 




acaccatcag 


ggttcacagc 


aaattcaaca 


ttattagact 


tggcataatt 


ggaagtatac 


180 


25 


tgaatttctg 


ggttccagcg 


cttgctgttt 


tctttctgca 


gctcccactc 


gatttccacg 


240 




ctgacttgtc 


cggtgctgta 


ctgcgtgatg 


aaggaggcaa 


acttggcagg 


agtaaacacc 


300 




tccggaggat 


tagcaggtac 


aggtgtgttt 


ttgatcagga 


tctgtggagg 


cgggtgttcg 


360 


30 


agtccaaaac 


cgcccatcag 


aggagacggg 


tgaaagttgc 


catccgtgtg 


aggaattttg 


420 




gcccagatgg 




CL u d Cl ^ Ul^O 




agacc at gcc 


aggcagtgct 


480 




ccctggctgt 


tgatagtctg 


tgattgaggc 


ccggctgtag 


acgactgcag 


gttgctggag 


540 


35 


accacaccgt 


attcttctgt 


agccacaggg 


ttagtggtct 


tgatctcctc 


ctcgctggtc 


600 




atcagaacgt 


tttccagcgt 


agttttgttg 


gcagctcccg 


tcttgccaaa 


aaccagtacc 


660 




ccgttgatgg 


gaaagaactg 


gtcct catca 


tccttgttgg 


ttgccatggg 


aatgccagga 


720 


40 


ttggtcaaag 


agtttctgcc 


attcagatga 


tatttagtgg 


cagcagtcca 


ggcaaaattg 


780 




ctgttgttgt 


taaagtccaa 


gttctttgac 


agtctctgtt 


gcctatagca 


gggtccaggc 


840 




agccagttct 


ttgactgctc 


ggccatagta 


ttgggcccag 


cttgatgaaa 


ttgcagttcc 


900 


45 


ctggtggaac 


ccgtggtgct 


ctgggtccgg 


gccaggtagt 


acaggtactg 


gtcgatgaga 


960 




gggttcatca 


gccggtctag 


gctctggctg 


tgcgcgtagc 


tgctgtggaa 


aggcacgtcc 


1020 




tcgaagctgt 


aactgaactc 


aaagttgttg 


cccgttctca 


gcatttgaga 


ggggaaatat 


1080 


50 


tccaggcagt 


agaaggagga 


acggcccacc 


gattggctgc 


cgttgttcag 


agtcaggtag 


1140 




ccgtactgag 


gaatcatgaa 


gacgtccgcc 


gggaacggag 


gcaggcagcc 


ctggtgcgca 


1200 




gagccgagga 


cgtacggcag 


ctggtattcc 


gagtccgaga 


agacctgaac 


cgtgctggta 


1260 


55 


aggttattag 


cgatggtcgt 


gacgccgtca 


ttcgttgtga 


cctccttgac 


ctggatgttg 


1320 
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accgcagctt 


cttigggccgg 


aacccoL^dyu 


v;yutyci-ya.L- 


gagt cgc tgc 


138 0 


5 




gt gagaagtg 


gcagtcfgaat 


c u y t L a. aa y u 


caaaataccc 


ccagggggtg 


T /I /I n 




c t g t a. gc c ga 
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25 
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ctggttggct 


2 XdU 




ttgggtttcg 


gggctccagg 


ttt caggtcc 


caccactcgc 


gaatgccct c 


agagaggttg 
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45 


ctgcaacggc 


tgctggtgct 


cgaaggtggt 


gctgttcccg 


tcgatcacgg 


cgcgcatgtt 


2880 




ggtgttggag 


gtgacgatca 


cgggggtggg 


atcgatctgg 


gcggacgact 


tgcacttttg 


2940 


50 


gtccacgcgc 


accttgctgc 


cgccgagaat 


ggccttggcg 


gactccacga 


ccttggccgt 


3000 


catcttgccc 


tcctcccacc 


agatcaccat 


cttgtcgacg 


caatcgttga 


agggaaagtt 


3060 




ctcattggtc 


cagttgacgc 


agcaagggcg 


aattc 






3095 
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<210> 25 

<211> 3142 

<212> DNA 

<213> new AAV serotype, clone H6 

<400> 25 





aaaacgacgg 


gccagtgatt 


gtaatacgac 


tcactatagg 


gcgaaattga 


aattagcggc 


60 


10 


cgcgaattcg 


cctttcgcag 


agaccaaagt 


tcaactgaaa 


cgaattaaac 


ggtttattga 


120 


t taacaagca 


attacagatt 


acgagtcagg 


tatctggtgc 


caatggggcg 


aggctctgaa 


180 




tacacaccat 


tagtgtccac 


agtaaagtcc 


acattaacag 


acttgttgta 


gttggaagtg 


240 


15 


tactgaattt 


cgggattcca 


gcgtttgctg 


ttctccttct 


gcagctccca 


ctcgatctcc 


300 


acgctgacct 


gtcccgtgga 


atactgtgtg 


atgaaagaag 


caaacttggc 


agaactgaag 


360 




tttgtgggag 


gattggctgg 


aacgggagtg 


tttttgatca 


tgatctgagg 


aggcgggtgt 


420 


20 


ttgagtccaa 


aacctcccat 


cagtggagaa 


ggatgaaagt 


gtccatcggt 


gtgaggaatc 


480 


ttggcccaaa 


tgggtccctg 


caggtacacg 


tctcgatcct 


gccacaccat 


accaggtaac 


540 




gctccttggt 


gattgacagt 


tccagtagtt 


ggaccagtgt 


ttgagttttg 


caaattattt 


600 


25 


gacacagtcc 


cgtactgctc 


cgtagccacg 


ggattggtgg 


ccctgatttc 


ttcttcatct 


660 




gtaatcatga 


cattttccaa 


atccgcgtcg 


ttggcatttg 


ttccttgttt 


accaaatatc 


720 




agggttccat 


gcatggggaa 


aaacttttct 


tcgtcatcct 


tgtgactggc 


catagctggt 


780 


30 


cctggattaa 


ccaacgagtc 


ccggccattt 


agatgatact 


ttgtagctgc 


agtccaggga 


840 




aagttgctgt 


tgttgttgtc 


gtttgcctgt 


tttgacagac 


gctgctgtct 


gtagcaaggt 


900 




ccaggcagcc 


agtttttagc 


ttgaagagac 


atgttggttg 


gtccagcttg 


gctaaacagt 


960 


35 


agccgagact 


gctgaagagt 


tccactattt 


gtttgtgtct 


tgttcagata 


atacaggtac 


1020 




tggtcgatca 


gaggattcat 


cagccgatcc 


agactctggc 


tgtgagcgta 


gctgctgtgg 


1080 




aaaggcacgt 


cttcaaaagt 


gtagctgaac 


tgaaagttgt 


ttccagtacg 


cagcatctga 


1140 


40 


gaaggaaagt 


actccaggca 


gtaaaaggaa 


gagcgtccta 


ccgcctgact 


cccgttgttc 


1200 




agggtgaggt 


atccatactg 


tgggaccatg 


aagacgtccg 


ctggaaacgg 


cgggaggcat 


1260 




ccttgatgcg 


ccgagcccag 


gacgtacggg 


agctggtact 


ccgagtcagt 


aaacacctga 


1320 


45 


accgtgctgg 


taaggttatt 


ggcaatcgtc 


gtcgtaccgt 


cattctgcgt 


gacctctttg 


1380 




acttgaatat 


taaagagctt 


gaagttgagt 


cttttgggcc 


ggaatccccg 


gttgttgttg 


1440 




acgagtcttt 


gccagtcacg 


tggtgaaaag 


tggcagtgga 


atctgttgaa 


gtcaaaatac 


1500 


50 


ccccaggggg 


tgctgtagcc 


aaagtagtgg 


ttgtcgttgc 


tggctcctga 


ttggctggag 


1560 




atttgcttgt 


agaggtggtt 


gttgtatgtg 


ggcagggccc 


aggttcgggt 


gctggtggtg 


1620 




atgactctgt 


cgcccagcca 


ttgggaatcg 


caatgccaat 


ttcctgagga 


attacccact 


1680 


55 


ccatcggcac 


cctcgttatt 


gtctgccatt 


ggtgcgccac 


tgcctgtagc 


cattgtagta 


1740 
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gatcccagac 


cagagggggc 


tgctggtggc 


tgtccgagag 


gctgggggtc 


aggtacggag 


1800 


tctgcgtctc 


cagtctgacc 


aaaatttaat 


ctttttcttg 


caggctgctg 


gcccgctttt 


1860 


ccggttcccg 


aggaggagtc 


tggctccaca 


ggagagtgct 


ctaccggcct 


cttttttccc 


1920 


ggagccgtct 


taacaggctc 


ctcaaccagg 


cccagaggtt 


caagaaccct 


ctttttcgcc 


1980 


tggaagactg 


ctcgtccgag 


gttgccccca 


aaagacgtat 


cttctttaag 


gcgctcctga 


2040 


aactctgcgt 


cggcgtggtt 


gtacttgagg 


tacgggttgt 


ctccgctgtc 


gagctgccgg 


2100 


tcgtaggcct 


tgtcgtgctc 


gagggccgcg 


gcgtctgcct 


cgttgaccgg 


ctcccccttg 


2160 


tcgagtccgt 


tgaagggtcc 


gaggtacttg 


tacccaggaa 


gcacaagacc 


cctgctgtcg 


2220 


tccttatgcc 


gctctgcggg 


ctttggtggt 


ggtgggccag 


gtttgagctt 


ccaccactgt 


2280 


cttattcctt 


cagagagagt 


gtcctcgagc 


caatctggaa 


gataaccatc 


ggcagccata 


2340 


cctgatttaa 


atcatttatt 


gttcagagat 


gcagtcatcc 


aaatccacat 


tgaccagatc 


2400 


gcaggcagtg 


caagcgtctg 


gcacctttcc 


catgatatga 


tgaatgtagc 


acagtttctg 


2460 


atacgccttt 


ttgacgacag 


aaacgggttg 


agattctgac 


acgggaaagc 


actctaaaca 


2520 


gtctttctgt 


ccgtgagtga 


agcagatatt 


tgaattctga 


ttcattctct 


cgcattgtct 


2580 


gcagggaaac 


agcatcagat 


tcatgcccac 


gtgacgagaa 


catttgtttt 


ggtacctgtc 


2640 


cgcgtagttg 


atcgaagctt 


ccgcgtctga 


cgtcgatggc 


tgcgcaactg 


actcgcgcgc 


2700 


ccgtttgggc 


tcacttatat 


ctgcgtcact 


gggggcgggt 


cttttcttag 


ctccaccctt 


2760 


tttgacgtag 


aattcatgct 


ccacctcaac 


cacgtgatcc 


tttgcccacc 


ggaaaaagtc 


2820 


tttcacttcc 


tgcttggtga 


cctttccaaa 


gtcatgatcc 


agacggcggg 


taagttcaaa 


2880 


tttgaacatc 


cggtcttgca 


acggctgctg 


gtgctcgaag 


gtcgttgagt 


tcccgtcaat 


2940 


cacggcgcac 


atgttggtgt 


tggaggtgac 


gatcacggga 


gtcgggtcta 


tctgggccga 


3000 


ggacttgcat 


ttctggtcca 


cacgcacctt 


gcttcctcca 


agaatggctt 


tggccgactc 


3060 


cacgaccttg 


gcggtcatct 


tcccctcctc 


ccaccagatc 


accatcttgt 


cgacgcaatg 


3120 


gtaaaaggaa 


agttctcatt 


gg 








3142 



<210> 26 
<211> 3075 
<212> DNA 

<213> new AAV serotype, clone H2 
<400> 26 

tgagaacttt cctttcaacg attgcgtcgg acaagatggt gatctggtgg gaggagggga 60 
agatgaccgc caaggtcgtg gagtcggcca aagccattct tggaggaagc aaggtgcgtg 120 
tggaccagaa atgcaagtcc tcggcccaga tagacccgac tcccgtgatc gtcacctcca 180 
acaccaacat gtgcgccgtg attgacggga actcaacgac cttcgagcac cagcagccgt 24 0 
tgcaagaccg gatgttcaaa tttgaactta cccgccgtct ggatcatgac tttggaaagg 300 
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1440 


35 


ca.a.ts.scg'sg" 


ggtgccgatg 


gagtgggtaa 


1 1 cctcagga 


aatt ggcat t 


gcgattccca 


1500 




stcrcfctcfcrcfc 


gacagagt ca 


tcaccaccag 


cacccgaacc 


t gggccctgc 


ccacatacaa 


1560 




caaccacctc 


t acaagcaaa 


tctccagcca 


atcaggagcc 


agcaacgaca 


ac cact ac tt 


1620 


40 


t ggct acagc 


accccctggg 


ggt attt tga 


cttcaacaga 


ttccact gcc 


ac ttttcacc 


1680 




arrftTraflTm 


caaagactca 


t caacaacaa 


ct ggggattc 


cggcccaaaa 


gactcaactt 


1740 






aatzati't caag 


tcaaagaggt 


cacgcagaat 


gacggtacga 


cgacgattgc 


1800 


45 


csia. taa.cc 1 1 


accagcacgg 


ttcaggtgtt 


tactgactcg 


gagtaccagc 


tcccgtacgt 


1860 




r'r"+*rrffrrr'tT"(T 


gcgcaticaag 


gat gcc tccc 


gccgtttcca 


gcggacgtct 


tcatggtccc 


1920 




acagtatgga 


tacctcaccc 


tgaacaacgg 


gagtcaggcg 


gtaggacgct 


cttcctttta 


1980 


50 


ctgcctggag 


tactttcctt 


ctcagatgct 


gcgtactgga 


aacaactttc 


agttcagcta 


2040 




cacttttgaa 


gacgtgcctt 


tccacagcag 


ctacgctcac 


agccagagtc 


tggatcggct 


2100 


55 


gatgaatcct 


ctgatcgacc 


agtacctgta 


ttatctgaac 


aagacacaaa 


caaatagtgg 


2160 


aactcttcag 


cagtctcggc 


tactgtttag 


ccaagctgga 


ccaaccaaca 


tgtctcttca 


2220 
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agctaaaaac 


tggctgcctg 


gaccttgcta 


cagacagcag 


cgtctgl: caa 


aacaggcaaa 


228 0 


5 


cgacaacaac 


aacagcaact 


ttccctggac 


tgcagctaca 


aagtatcatc 


taaatggccg 


2340 




ggactcgttg 


gttaatccag 


gaccagctat 


ggccagtcac 


aaggatgacg 


aagaaaagtt 


2400 




tttccccatg 


catggaaccc 


tgatatttgg 


taaacaagga 


acaaatgcca 


acgacgcgga 


2460 


10 


tttggaaaat 


gtcatgatta 


cagatgaaga 


agaaatcagg 


gccaccaatc 


ccgtggctac 


2520 




ggagcagtac 


gggactgtgt 


caaataattt 


gcaaaactca 


aacactggtc 


caactactgg 


2580 




aactgtcaat 


cgccaaggag 


cgttacctgg 


tatggtgtgg 


caggatcgag 


acgtgtacct 


2 640 


15 


gcagggaccc 


atttgggcca 


agattcctca 


caccgatgga 


cactttcatc 


cttctccact 


2700 




gatgggaggt 


tttggactca 


aacacccgcc 


tcctcagatc 


atgatcaaaa 


acactcccgt 


2760 




tccagccaat 


cctcccacaa 


acttcagttc 


tgccaagttt 


gcttctttca 


tcacacagta 


2820 


20 


ttccacggga 


caggtcagcg 


tggagatcga 


gtgggagctg 


cagaaggaga 


acagcaaacg 


2880 




ctggaatccc 


gaaattcagt 


acacttccaa 


ctacaacaag 


tctgttaatg 


tggactttac 


2940 




tgtggacact 


aatggtgtgt 


attcagagcc 


tcgccccatt 


ggcaccagat 


acctgactcg 


30C0 


25 


taatctgtaa 


ttgcttgtta 


atcaataaac 


cgtttaattc 


gtttcagttg 


aactttggtc 


3060 




tctgcgaagg 


gcgaa 










3075 



<210> 27 

30 <211> 3128 

<212> DNA 

<213> new AAV serotype, clone 42.8 

<400> 27 



35 


gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 




gcgtcgacaa 


gatggtgatc 


tggtgcgagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


120 




ccgccaaggc 


cattctcggc 


ggcagcaagg 


tgcgcgtgga 


ccaaaagtgc 


aagtcttccg 


180 


40 


cccagatcga 


tcccaccccc 


gtgatcgtca 


cttccaacac 


caacatgtgc 


gccgtgattg 


240 




acgggaacag 


caccaccttc 


gagcaccagc 


agccgttaca 


agaccggatg 


ttcaaatttg 


300 




aactcacccg 


ccgtctggag 


cacgactttg 


gcaaggtgac 


aaagcaggaa 


gtcaaagagt 


360 


45 


tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


420 




gtggagccaa 


caagagaccc 


gcccccgatg 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 




gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggt g 


gactttgccg 


540 


50 


acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 




agacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accagagact 


660 




gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 


55 


ggaaactctg 


tgccattcat 


catctgctag 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 
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gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 840 

ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 900 

5 

cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 1140 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagaccggt agagccatca ccccagcgtt ctccagactc ctctacgggc 132 0 

atcggcaaga caggccagca gcccgcgaaa aagagactca actttgggca gactggcgac 1380 

2Q tcagagtcag tgcccgaccc tcaaccaatc ggagaacccc ccgcaggccc ctctggtctg 1440 

ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 1500 

gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 1560 

25 gtcatcacca ccagcacccg aacctgggcc ctccccacct acaacaacca cctctacaag 1620 

caaatctcca acgggacatc gggaggaagc accaacgaca acacctactt cggctacagc 1680 

accccctggg ggtattttga ctttaacaga ttccactgcc acttctcacc acgtgactgg 1740 

30 cagcgactca tcaacaacaa ctggggattc cggcccaaga gactcaactt caagctcttc 1800 

aacatccagg tcaaggaggt cacgcagaat gaaggcacca agaccatcgc caataacctt 1860 

accagcacga ttcaggtctt tacggactcg gaataccagc tcccgtacgt cctcggctct 1920 

35 gcgcaccagg gctgcctgcc tccgttcccg gcggacgtct tcatgattcc tcagtacggg 1980 

tacctgactc tgaacaacgg cagtcaggcc gtgggccgtt cctccttcta ctgcctggag 2 04 0 

tactttcctt ctcaaatgct gagaacgggc aacaactttg agttcagcta ccagtttgag 2100 

40 gacgtgcctt ttcacagcag ctacgcgcac agccaaagcc tggaccggct gatgaacccc 2160 

ctcatcgacc agtacctgta ctacctgtct cggactcagt ccacgggagg taccgcagga 2220 

actcagcagt tgctattttc tcaggccggg cctaataaca tgtcggctca ggccaaaaac 2280 

tggctacccg ggccctgcta ccggcagcaa cgcgtctcca cgacactgtc gcaaaataac 2340 

aacagcaact ttgcttggac cggtgccacc aagtatcatc tgaatggcag agactctctg 2400 

gtaaatcccg gtgtcgctat ggcaacgcac aaggacgacg aagagcgatt ttttccatcc 2460 

50 agcggagtct tgatgtttgg gaaacaggga gctggaaaag acaacgtgga ctatagcagc 2 52 0 

gttatgctaa ccagtgagga agaaatcaaa accaccaacc cagtggccac agaacagtac 2580 

ggcgtggtgg ccgataacct gcaacagcaa aacgccgctc ctattgtagg ggccgtcaac 2640 

^5 agtcaaggag ccttacctgg catggtctgg cagaaccggg acgtgtacct gcagggtcct 2700 
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atctgggcca agattcctca cacggacggc aactttcatc cttcgccgct gatgggaggc 2760 

tttggactga aacacccgcc tcctcagatc ctgattaaga atacacctgt tcccgcggat 2820 

cctccaacta ccttcagtca agccaagctg gcgtcgttca tcacgcagta cagcaccgga 288 0 

caggtcagcg tggaaattga atgggagctg cagaaagaga acagcaagcg ctggaaccca 2940 

gagattcagt atacttccaa ctactacaaa tctacaaatg tggactttgc tgtcaatact 3000 

gagggtactt attcagagcc tcgccccatt ggcacccgtt acctcacccg taacctgtaa 3060 

ttgcctgtta atcaataaac cggctaattc gtttcagttg aactttggtc tctgcgaagg 3120 

gcgaattc 3128 

<210> 28 

<211> 3128 

<212> DMA 

<213> new AAV serotype, clone 42.15 

<400> 28 



gaat t cgccc 


tt tctacggc 


t gcgt caact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


12 0 


ccgccaaggc 


cattctcggc 


ggcagcaagg 


tgcgcgtgga 


ccaaaagtgc 


aagtcgtccg 


180 


cccagat cga 


ccccaccccc 


gtgatcgtca 


cc tccaacac 


caacatg tgc 


gccgtgattg 


2 4 0 


acgggaacag 


caccaccttc 


gagcaccagc 


agccgt tgca 


ggaccggatg 


tt caaatttg 


300 


aactcacccg 


ccgtctggag 


catgactttg 


gcaaggtgac 


aaagcaggaa 


gtcaaagagt 


360 


tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


420 


gtggagccaa 


caagagaccc 


gcccccgat g 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 


gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 


acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 


agacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcgcggg 


accagagact 


660 


gttcagaatg 


tttcccgggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 


ggaaactctg 


tgccattcat 


catctgctgg 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 


gcgatctggt 


caacgtggac 


ctggatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 


ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 


cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 


gacaaggggg 


agcccgtcaa 


cgcggcggac 


gcagcggccc 


tcgagcacga 


caaggcctac 


1080 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


cgacgccgag 


1140 


tttcaggagc 


gtctgcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 


caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 
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cctggaaaga 


agagaccggt 


agagccatca 


ccccagcgtt 


ctccagactc 


ctctacgggc 


1320 


atcggcaaga 


caggccagca 


gcccgcgaaa 


aagagactca 


actttgggca 


gactggcgac 


1380 


tcagagtcag 


tgcccgaccc 


tcaaccaatc 


ggagaacccc 


ccgcaggccc 


ctctggtctg 


1440 


ggatctggta 


caatggctgc 


aggcggtggc 


gctccaatgg 


cagacaataa 


cgaaggcgcc 


1500 


gacggagtgg 


gtagttcctc 


aggaaattgg 


cattgcgatt 


ccacatggct 


gggcgacaga 


1560 


gtcatcacca 


ccagcacccg 


aacctgggcc 


ctccccacct 


acaacaacca 


cctctacaag 


1620 


caaatctcca 


acgggacatc 


gggaggaagc 


accaacgaca 


acacctactt 


cggctacagc 


1680 


accccctggg 


ggtattttga 


ctttaacaga 


ttccactgcc 


acttctcacc 


acgtgactgg 


1740 


cagcgactca 


tcaacaacaa 


ctggggattc 


cggcccaaga 


gactcaactt 


caagctcttc 


1800 


aacatccagg 


tcaaggaggt 


cacgcagaat 


gaaggcacca 


agaccatcgc 


caataacctt 


1860 


accagcacga 


ttcaggtctt 


tacggactcg 


gaataccagc 


tcccgtacgt 


cctcggctct 


1920 


gcgcaccagg 


gctgcccgcc 


tccgttcccg 


gcggacgtct 


tcatgattcc 


tcagtacggg 


1980 


tacctgactc 


tgaacaacgg 


cagtcaggcc 


gtgggccgtt 


cctccttcta 


ctgcctggag 


2040 


tactttcctt 


ctcaaatgcg 


gagaacgggc 


aacaactttg 


agttcagcta 


ccagtttgag 


2100 


gacgtgcctt 


ttcacagcag 


ctacgcgcat 


agccaaagcc 


tggaccggct 


gatgaacccc 


2160 


ctcatcgacc 


agtacctgta 


ctacctgtct 


cggact cagt 


ccacgggagg 


taccgcagga 


2220 


actcagcagt 


tgctattttc 


tcaggccggg 


cctaataaca 


tgtcggctca 


ggccaaaaac 


2280 


tggctacccg 


ggccctgcta 


ccggcagcaa 


cgcgtctcca 


cgacactgtc 


gcaaaataac 


2340 


aacagcaact 


ttgcttggac 


cggtgccacc 


aagtatcatc 


tgaatggcag 


agac tctctg 


2400 


gtaaatcccg 


gtgtcgctat 


ggcaacgcac 


aaggacgacg 


aagagcgatt 


ttttccatcc 


2460 


agcggagtct 


tgatgtttgg 


gaaacaggga 


gctggaaaag 


acaacgtgga 


ctatagcagc 


2520 


gttatgctaa 


ccagtgagga 


agaaatcaaa 


accaccaacc 


cagtggccac 


agaacagtac 


2580 


ggcgtggtgg 


ccgataacct 


gcaacagcaa 


aacgccgctc 


ctattgtagg 


ggccgtcaac 


2640 


agtcaaggag 


ccttacctgg 


catggtctgg 


cagaaccggg 


acgtgtacct 


gcagggtcct 


2700 


atctgggcca 


agattcctca 


cacggacggc 


aactttcatc 


cttcgccgct 


gatgggaggc 


2760 


tttggactga 


aacacccgcc 


tcctcagatc 


ctgattaaga 


atacacctgt 


tcccgcggat 


2820 


cctccaacta 


ccttcagtca 


agccaagctg 


gcgtcgttca 


tcacgcagta 


cagcaccgga 


2880 


caggtcagcg 


tggaaattga 


atgggagctg 


cagaaagaga 


acagcaagcg 


ctggaaccca 


2940 


gagattcagt 


atacttccaa 


ctactacaaa 


tctacaaatg 


tggactttgc 


tgtcaatact 


3000 


gagggtactt 


attcagagcc 


tcgccccatt 


ggcacccgtt 


acctcacccg 


taacctgtaa 


3060 


ttgcctgtta 


atcaataaac 


cggttaattc 


gtttcagttg 


aactttggtc 


tctgcgaagg 


3120 


gcgaattc 
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<210> 29 

<211> 3197 

<212> DNA 

5 <213> new AAV serotype, clone 42.5b 

<400> 29 





gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


10 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


120 


ccgccaaggc 


cattctcggc 


ggcagcaagg 


tgcgcgtgga 


ccaaaagtgc 


aagtcgtccg 


180 




cccagatcga 


ccccaccccc 


gtgatcgtca 


cctccaacac 


caacatgtgc 


gccgtgattg 


240 


15 


acgggaacag 


caccaccttc 


gagcaccagc 


agccgttaca 


agaccggatg 


ttcaaatttg 


300 




aactcacccg 


ccgtctggag 


cacgactttg 


gcaaggtgac 


aaagcaggaa 


gtcaaagagt 


360 




tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


420 


20 


gtggagccaa 


caagagaccc 


gcccccgatg 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 




gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 




acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


6C0 




agacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accagagact 


660 




gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 




ggaaactctg 


tgccattcat 


catctgctgg 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 


30 


gcgatctggt 


caacgtggac 


ctggatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 




ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 




cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


35 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 




gacaagggag 


agccggtcaa 


cgaggcagac 


gccgcggccc 


tcgagcacga 


caaggcctac 


1080 




gacaagcagc 


tcgagcaggg 


ggacaacccg 


tacctcaagt 


acaaccacgc 


cgacgccgag 


1140 


40 


tttcaggagc 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 




caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 




cctggaaaga 


acfacf acccf crt 


agagccatca 


ccccagcgtt 


ctccagactc 


ctctacgggc 


1320 


45 


atcggcaaga 


caggccagca 


gcccgcgaaa 


aagagactca 


actttgggca 


gactggcgac 


1380 




tcagagtcag 


tgcccgaccc 


tcaaccaatc 


ggagaacccc 


ccgcaggccc 


ctctggtctg 


1440 




ggatctggta 


caatggctgc 


aggcggtggc 


gctccaatgg 


cagacaataa 


cgaaggcgcc 


1500 


50 


gacggagtgg 


gtagttcctc 


aggaaattgg 


cattgcgatt 


ccacatggct 


gggcgacaga 


1560 




gtcatcacca 


ccagcacccg 


aacctgggcc 


ctccccacct 


acaacaacca 


cctctacaag 


1620 




caaatctcca 


acgggacatc 


gggaggaagc 


accaacgaca 


acacctactt 


cggctacagc 


1680 


55 


accccctggg 


ggtattttga 


ctttaacaga 


ttccactgcc 


acttctcacc 


acgtgactgg 


1740 
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cagcgactca 


tcaacaacaa 


ctggggattc 


cggcccaaga 


gactcaactt 


caagctcttc 


1800 


aacatccagg 


tcaaggaggt 


cacgcagaat 


gaaggcacca 


agaccatcgc 


caataacctt 


1860 


accagcacga 


ttcaggtctt 


tacggactcg 


gaataccagc 


tcccgtacgt 


cctcggctct 


1920 


gcgcaccagg 


gctgcctgcc 


tccgttcccg 


gcggacgtct 


tcatgattcc 


tcagtacggg 


1980 


tacctgactc 


tgaacaacgg 


cagtcaggcc 


gtgggccgtt 


cctccttcta 


ctgcctggag 


2040 


tactttcctt 


ct caaatgct 


gagaacgggc 


aacaactttg 


agt tcagcta 


ccagtttgag 


2100 


gacgtgcctt 


ttcacagcag 


ctacgcgcac 


agccaaagcc 


tggaccggct 


gatgaacccc 


2160 


ctcatcgacc 


agtacctgta 


ctacctgtct 


cggactcagt 


ccacgggagg 


taccgcagga 


2220 


actcagcagt 


tgctattttc 


tcaggccggg 


cctaataaca 


tgtcggctca 


ggccaaaaac 


2280 


tggctacccg 


ggccctgcta 


ccggcagcaa 


cgcgtctcca 


cgacactgtc 


gcaaaataac 


2340 


aacagcaact 


ttgcttggac 


cggtgccacc 


aagtatcatc 


tgaatggcag 


agactctctg 


2400 


gtaaatcccg 


gtgtcgctat 


ggcaacgcac 


aaggacgacg 


aagagcgatt 


ttttccatcc 


2460 


agcggagtct 


tgatgtttgg 


gaaacaggga 


gctggaaaag 


acaacgtgga 


ctatagcagc 


2520 


gttatgctaa 


ccagtgagga 


agaaatcaaa 


accaccaacc 


cagtggccac 


agaacagtac 


2580 


ggcgtggtgg 


ccgataacct 


gcaacagcaa 


aacgccgctc 


ctattgtagg 


ggccgtcaac 


2640 


agtcaaggag 


ccttacctgg 


catggtctgg 


cagaaccggg 


acgtgtacct 


gcagggtcct 


2700 


atctgggcca 


agattcctca 


cacggacggc 


aactttcatc 


cttcgccgct 


gatgggaggc 


2760 


tttggactga 


aacacccgcc 


tcctcagatc 


ctgattaaga 


atacacctgt 


tcccgcggat 


2820 


cctccaacta 


ccttcagtca 


agccaagctg 


gcgtcgttca 


tcacgcagta 


cagcaccgga 


2880 


caggtcagcg 


tggaaattga 


atgggagctg 


cagaaagaga 


acagcaagcg 


ctggaaccca 


2940 


gagattcagt 


atacttccaa 


ctactacaaa 


tctacaaatg 


tggactttgc 


tgtcaatact 


3000 


gagggtactt 


attcagagcc 


tcgccccatt 


ggcacccgtt 


acctcacccg 


taacctgtaa 


3060 


ttgcctgtta 


atcaataaac 


cggttaattc 


gtttcagttg 


aactttggtc 


tctgcgaagg 


3120 


gcgaattcgt 


ttaaacctgc 


aggactagtc 


cctttagtga 


gggttaattc 


tgagcttggc 


3180 


gtaatcatgg 


gtcatag 










3197 



45 

<210> 30 

<211> 2501 

<212> DNA 

<213> new AAV serotype, clone 42.1b 

50 <400> 30 

gaattcgccc ttggctgcgt caactggacc aatgagaact ttcccttcaa cgattgcgtc 60 

gacaagatgg tgatctggtg ggaggagggc aagatgacgg ccaaggtcgt ggagtccgcc 120 

aaggccattc atcatctgct ggggcgggct cccgagattg cttgctcggc ctgcgatctg 180 

55 

gtcaacgtgg acctggatga ctgtgtttct gagcaataaa tgacttaaac caggtatggc 240 
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tgccgatggt 


tatcttccag 


attggctcga 


ggacaacctc 


tctgagggca 


ttcgcgagtg 


300 


gtgggacttg 


agacctggag 


ccccgaaacc 


caaagccaac 


cagcaaaagc 


aggacgacgg 


360 


ccggggtctg 


gtgcttcctg 


gctacaagta 


cctcggaccc 


ttcaacggac 


tcgacaaggg 


420 


agagccggtc 


aacgaggcag 


acgccgcggc 


cctcgagcac 


gacaaggcct 


acgacaagca 


480 


gctcgagcag 


ggggacaacc 


cgtacctcaa 


gtacaaccac 


gccgacgccg 


agtttcagga 


540 


gcgtcttcaa 


gaagatacgt 


cttttggggg 


caacctcggg 


cgagcagtct 


tccaggccaa 


600 


gaagcgggtt 


ctcgaacctc 


tcggtctggt 


tgaggaaggc 


gctaagacgg 


ctcctggaaa 


660 


gaagagaccc 


atagaatccc 


ccgactcctc 


cacgggcatc 


ggcaagaaag 


gccagcagcc 


720 


cgctaaaaag 


agactcaact 


ttgggcagac 


tggcgactca 


gagtcagtgc 


ccgaccctca 


780 


accaatcgga 


gaaccccccg 


caggcccctc 


tggtctggga 


tctggcacaa 


tggctgcagg 


840 


cggtggcgct 


ccaatggcag 


acaataacga 


aggcgccgac 


ggagtgggta 


gttcctcagg 


900 


aaattggcat 


tgcgat tcca 


catggctggg 


cgacagagtc 


atcaccacca 


gcacccgaac 


960 


ctgggccctc 


cccacctaca 


acaaccacct 


ctacaagcaa 


atctccaacg 


ggacatcggg 


1020 


aggaagcacc 


aacgacaaca 


cctacttcgg 


ctacagcacc 


ccctgggggt 


attttgactt 


1080 


taacagattc 


cactgccact 


tctcaccacg 


tgactggcag 


cgactcatca 


acaacaactg 


1140 


gggattccgg 


cccaagagac 


tcaactrcaa 


gctcttcaac 


atccaggtca 


aggaggtcac 


1200 


gcagaatgaa 


ggcaccaaga 


ccatcgccaa 


taaccttacc 


agcacgattc 


aggtctttac 


1260 


ggactcggaa 


taccagctcc 


cgtacgrcct 


cggctctgcg 


caccagggct 


gcctgcctcc 


1320 


gttcccggcg 


gacgtcttca 


tgattcctca 


gtacgggtac 


ctgactctga 


acaacggcag 


1380 


tcaggccgtg 


ggccgttcct 


ccttctactg 


cctggagtac 


tttccttctc 


aaatgctgag 


1440 


aacgggcaac 


aactttgagt 


tcagctacca 


gtttgaggac 


gtgccttttc 


acagcagcta 


1500 


tgcgcacagc 


caaagcctgg 


accggctgat 


gaaccccctc 


atcgaccagt 


acctgtacta 


1560 


cctgtctcgg 


actcagtcca 


cgggaggtac 


cgcaggaact 


cagcagttgc 


tattttctca 


1620 


ggccgggcct 


aataacatgt 


cggctcaggc 


caaaaactgg 


ctacccgggc 


cctgctaccg 


1680 


gcagcaacgc 


gtctccacga 


cagtgtcgca 


aaataacaac 


agcaactttg 


cttggaccgg 


1740 


tgccaccaag 


tatcatctga 


atggcagaga 


ctctctggta 


aatcccggtg 


tcgctatggc 


1800 


aacgcacaag 


ggcgacgaag 


agcgattttt 


tccatccagc 


ggagtcttga 


tgtttgggaa 


1860 


acagggagct 


ggaaaagaca 


acgtagacta 


tagcagcgtt 


atgctaacca 


gtgaggaaga 


1920 


aatcaaaacc 


accaacccag 


tggccacaga 


acagtacggc 


gtggtggccg 


ataacctgca 


1980 


acagcaaaac 


gccgctccta 


ttgtaggggc 


cgtcaacagt 


caaggagcct 


tacctggcat 


2040 


ggtctggcag 


aaccgggacg 


tgtacctgca 


gggtcctatc 


tgggccaaga 


ttcctcacac 


2100 


ggacggcaac 


tttcatcctt 


cgccgctgat 


gggaggcttt 


ggactgaaac 


acccgcctcc 


2160 
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tcagatcctg 


attaagaata 


cacctgttcc 


cgcggatcct 


ccaactacct 


tcagtcaagc 


2220 


caagctggcg 


tcgttcatca 


cgcagtacag 


caccggacag 


gtcagcgtgg 


aaattgaatg 


2280 


ggagctgcag 


aaagagaaca 


gcaagcgctg 


gaacccagag 


attcagtata 


cttccaacta 


2340 


ctacaaatct 


acaaatgtgg 


actttgctgt 


caatactgag 


ggtacttatt 


cagagcctcg 


2400 


ccccattggc 


acccgttacc 


tcacccgtaa 


cctgtaattg 


cctgttaatc 


aataaaccgg 


2460 


ttgattcgtt 


tcagttgaac 


tttggtctca 


agggcgaatt 


c 




2501 


<210> 31 
<211> 3113 
<212> DNA 

<213> new AAV serotype, clone 42.13 








<400> 31 
gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


sigggcsLSLgeit 


gacggccaag 


gtcgtggagt 


120 


ccgccaaggc 


cattctcggc 


ggcagcaagg 


tgcgcgtgga 


ccaaaagtgc 


aagtcgtccg 


180 


cccagatcga 


tcccaccccc 


gtgatcgtca 


cttccaacac 


caacatgtgc 


gccgtgattg 


240 


acgggaacag 


caccaccttc 


gagcaccagc 


agccgttaca 


agaccggatg 


ttcaaatttg 


300 


aactcacccg 


ccgtctggag 


catgactttg 


gcaaggtgac 


aaagcaggaa 


gtcaaagagt 


36D 


tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


423 


gtggagccaa 


caagagaccc 


gcccccgatg 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 


gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 


acaggtacca 


aaacaaatgt 


t ctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 


agacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accagagact 


660 


gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 


ggaaactctg 


tgccattcat 


catctgctgg 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 


gcgatctggt 


caacgtggac 


ctggatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 


ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 


cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 


gacaaggggg 


agcccgtcaa 


cgcggcggac 


gcagcggccc 


tcgagcacga 


caaggcctac 


1080 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


cgacgccgag 


1140 


tttcaggagc 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 


caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 


cctggaaaga 


agagacccat 


agaatccccc 


gactcctcca 


cgggcatcgg 


caagaaaggc 


1320 


cagcagcccg 


ctaaaaagaa 


gctcaacttt 


gggcagactg 


gcgactcaga 


gtcagtgccc 


1380 
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gaccctcaac 


caatcggaga 


accccccgca 


ggcccctctg 


gtctgggatc 


tggtacaatg 


1440 


gctgcaggcg 


gtggcgctcc 


aatggcagac 


aataacgaag 


gcgccgacgg 


agtgggtagt 


1500 


tcctcaggaa 


attggcattg 


cgattccaca 


tggctgggcg 


acagagtcat 


caccaccagc 


1560 


acccgaacct 


gggccctccc 


cacctacaac 


aaccacctct 


acaagcaaat 


ctccaacggg 


1620 


acatcgggag 


gaagcaccaa 


cgacaacacc 


tacttcggct 


acagcacccc 


ctgggggtat 


1680 


tttgacttta 


acagattcca 


ctgccacttc 


tcaccacgtg 


actggcagcg 


actcatcaac 


1740 


aacaactggg 


gattccggcc 


caagagactc 


aacttcaagc 


tcttcaacat 


ccaggtcaag 


18C0 


gaggtcacgc 


agaatgaagg 


caccaagacc 


at cgccaata 


acctt accag 


cacgattcag 


1860 


gtctttacgg 


actcggaata 


ccagctcccg 


tacgtcctcg 


gctctgcgca 


ccagggctgc 


1920 


ctgcctccgt 


tcccggcgga 


cgtcttcatg 


attcctcagt 


acgggtacct 


gactctgaac 


1980 


aacggcagtc 


aggccgtggg 


ccgttcctcc 


ttctactgcc 


tggagtactt 


tcct tctcaa 


2040 


atgctgagaa 


cgggcaacaa 


ctttgagttc 


agctaccagt 


ttgaggacgt 


gccttttcac 


2100 


agcagctatg 


cgcacagcca 


aagcctggac 


cggctgatga 


accccctcat 


cgaccagtac 


2160 


ctgtactacc 


tgtctcggac 


tcagtccacg 


ggaggtaccg 


caggaactca 


gcagttgcta 


2220 


ttttctcagg 


ccgggcctaa 


taacatgtcg 


gctcaggcca 


aaaactggct 


acccgggccc 


2280 


tgctaccggc 


agcaacgcgt 


ctccacgaca 


gtgtcgcaaa 


ataacaacag 


caactttgct 


2340 


tggaccggtg 


ccaccaagta 


tcatctgaat 


ggcagagact 


ctctggtaaa 


tcccggtgtc 


2400 


gctatggcaa 


cgcacaaggg 


cgacgaagag 


cgattttttc 


catccagcgg 


agtcttgatg 


2460 


tttgggaaac 


agggagctgg 


aaaagacaac 


gtggactata 


gcagcgttat 


gctaaccagt 


2520 


gaggaagaaa 


tcaaaaccac 


caacccagtg 


gccacagaac 


agtacggcgt 


ggtggccgat 


2580 


aacctgcaac 


agcaaaacgc 


cgctcctatt 


gtaggggccg 


tcaacagtca 


aggagcctta 


2640 


cctggcatgg 


tctggcagaa 


ccgggacgtg 


tacctgcagg 


gtcctatctg 


ggccaagatt 


2700 


cctcacacgg 


acggcaactt 


tcatccttcg 


ccgctgatgg 


gaggctttgg 


actgaaacac 


2760 


ccgcctcctc 


agatcctgat 


taagaataca 


cctgttcccg 


cggatcctcc 


aactaccttc 


2820 


agtcaagcca 


agctggcgtc 


gttcatcacg 


cagtacagca 


ccggacaggt 


cagcgtggaa 


2880 


attgaatggg 


agctgcagaa 


agagaacagc 


aagcgc tgga 


acccagagat 


tcagtatact 


2940 


tccaactact 


acaaatctac 


aaatgtggac 


tttgctgtca 


atactgaggg 


tacttattca 


3000 


gagcctcgcc 


ccattggcac 


ccgttacctc 


acccgt agcc 


tgtaattgcc 


tgttaatcaa 


3060 


taaaccggtt 


gattcgtttc 


agttgaactt 


tggtctctgc 


gaagggcgaa 


ttc 


3113 



50 
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<400> 32 





gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


5 


g'cg'tccfacaa. 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


12 0 




ccg'cca.acfgc 


cattctcggc 


ggcagcaagg 


tgcgcgtgga 


ccaaaagtgc 


aagtcgtccg 


180 




cccacfaticga. 


tcccaccccc 


gtgatcgtca 


cttccaacac 


caacatgtgc 


gccgtgattg 


240 


10 




caccacc't'tc 


gagcaccagc 


agccgttaca 


agaccggatg 


ttcaaatttg 


300 




A a r** a ^ /—I /-« 


ccg'tc't ggag 


catgactttg 


gcaaggtgac 


aaagcaggaa 


gtcaaagagt 


350 




t cttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


420 


15 


g tig gage caa 


caagagaccc 


gcccccgatg 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 




gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 




acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


cttccctgca 


600 


20 


agaca'tgcga 


gaga at: ga at: 


cagaatttca 


gcattt gctt 


cacgcacggg 


accagagact 


660 




gtt cagaatg 


t;t:tccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 




ggaaactctg 


tgc cat teat 


catctgctgg 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 


25 


1^ ^ u 4^ 4~ 4— 

gcga uc tgg t 


caacgtggac 


ctggatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 




ggtatggctg 


ccgatggtca 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 




cgcgagtggt 


gggact tgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


30 




ggggtc tggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 




gacaaggggg 


agcccgtcaa 


cgcggcggac 


gcagcggccc 


tcgagcacga 


caaggcctac 


1080 




/V SI /X 1^ a r*r 


t c aaagcg gg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


cgacgccgag 


1140 


35 


tttcaggagc 


gt ctt:caaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 




^ /y A ^ ^ 


agcgggtt ct 


cgaacctct c 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 






agagacccat 


agaatccccc 


gactcctcca 


cgggcatcgg 


caagaaaggc 


1320 


40 


r* A rt Si fTf* t— r* n 


ct aaaaagaa 


get caactt t 


gggcagactg 


gcgactcaga 


gtcagtgccc 


1380 




y d ^ «w U-L- ctctv^ 


a. a. u u g gay a 


accccccgca 


ggcccctctg 


gtctgggatc 


tggtacaatg 


1440 


45 




gtggcgctcc 


aatggcagac 


aataacgaag 


gcgccgacgg 


agtgggtagt 


1500 


"tcctcaggaa 


at tggcattg 


cgattccaca 


tagctgggcg 


acagagtcat 


caccaccagc 


1560 




acccgaacct 


gggccc tccc 


cacctacaac 


aaccacctct 


acaagcaaat 


ctccaacggg 


1620 


50 


acatcgggag 


gaagcaccaa 


cgacaacacc 


tacttcggct 


acagcacccc 


ctgggggtat 


1680 


tttgacttta 


acagattcca 


ctgccacttc 


tcaccacgtg 


actggcagcg 


actcatcaac 


1740 




aacagctggg 


gattccggcc 


caagagactc 


aacttcaagc 


tcttcaacat 


ccaggtcaag 


1800 


55 


gaggtcacgc 


agaatgaagg 


caccaagacc 


atcgccaata 


accttaccag 


cacgattcag 


18 60 


gtctttacgg 


actcggaata 


ccagctcccg 


tacgtcctcg 


gctctgcgca 


ccagggctgc 


1920 
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ctgcctccgt 


tcccggcgga 


cgtcttcatg 


attcctcagt 


acgggtacct 


gactctgaac 


1980 


aacggcagtc 


aggccgtggg 


ccgttcctcc 


ttctactgcc 


tggagtactt 


tccttctcaa 


2040 


atgctgagaa 


cgggcaacaa 


ctttgagttc 


agctaccagt 


ttgaggacgt 


gccttttcac 


2100 


agcagctacg 


cgcacagcca 


aagcctggac 


cggctgatga 


accccctcat 


cgaccagtac 


2160 


ctgtactacc 


tgtctcggac 


tcagtccacg 


ggaggtaccg 


caggaactca 


gcagttgcta 


2220 


ttttctcagg 


ccgggcctaa 


taacatgtcg 


gctcaggcca 


aaaactggct 


acccgggccc 


2280 


tgctaccggc 


agcaacgcgt 


ctccacgaca 


ctgtcgcaaa 


ataacaacag 


caactttgct 


2340 


tggaccggtg 


ccaccaagta 


tcatctgaat 


ggcagagact 


ctctggtaaa 


tcccggtgtc 


24C0 


gctatggcaa 


cgcacaagga 


cgacgaagag 


cgattttttc 


catccagcgg 


agtcttgatg 


2460 


tttgggaaac 


agggagctgg 


aaaagacaac 


gtggactata 


gcagcgttat 


gctaaccagt 


2520 


gaggaagaaa 


tcaaaaccac 


caacccagtg 


gccacagaac 


agtacggcgt 


ggtggccgat 


2580 


aacctgcaac 


agcaaaacgc 


cgctcctatt 


gtaggggccg 


tcaacagtca 


aggagcctta 


2640 


cctggcatgg 


tctggcagaa 


ccgggacgtg 


tacctgcagg 


gtcctatctg 


ggccaagatt 


2700 


cctcacacgg 


acggcaactt 


tcatccttcg 


ccgctgatgg 


gaggctttgg 


actgaaacac 


2760 


ccgcctcctc 


agatcctgat 


taagaataca 


cctgttcccg 


cggatcctcc 


aactaccttc 


2820 


agtcaagcca 


agctggcgtc 


gttcatcacg 


cagtacagca 


ccggacaggt 


cagcgtggaa 


2880 


attgaatggg 


agctgcagaa 


agagaacagc 


aagcgctgga 


acccagagat 


tcagtatact 


2940 


tccaactact 


acaaatctac 


aaatgtggac 


tttgctgtca 


atactgaggg 


tacttattca 


3000 


gagcctcgcc 


ccattggcac 


ccgttacctc 


acccgtaacc 


tgtaattgcc 


tgttaatcaa 


3060 


taaaccggtt 


aattcgtttc 


agttgaactt 


tggtctctgc 


gaagggcgaa 


ttc 


3113 


<210> 33 

•v./i_L_L^ J U *: 

<212> DNA 
^^X-J-^ new 


[ 

AAV serotype, clone 42 










<4UU> oo 
gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


120 


ccgccaaggc 


cattcatcat 


ctgctggggc 


gggctcccga 


gattgcttgc 


tcggcctgcg 


180 


atctggtcaa 


cgtggacctg 


gatgactgtg 


tttctgagca 


ataaatgact 


taaaccaggt 


240 


atggctgccg 


atggttatct 


tccagattgg 


ctcgaggaca 


acctctctga 


gggcattcgc 


300 


gagtggtggg 


acttgaaacc 


tggagccccg 


aaacccaaag 


ccaaccagca 


aaagcaggac 


360 


gacggccggg 


gtctggtgct 


tcctggctac 


aagtacctcg 


gacccttcaa 


cggactcgac 


420 


aagggagagc 


cggtcaacga 


ggcagacgcc 


gcggccctcg 


agcacgacaa 


ggcctacgac 


480 


aagcagctcg 


agcaggggga 


caacccgtac 


ctcaagtaca 


accacgccga 


cgccgagttt 


540 
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caggagcgtc 


ttcaagaaga 


tacgtctttt 


gggggcaacc 


tcgggcgagc 


agtcttccag 


600 


gccaagaagc 


gggttctcga 


acctctcggt 


ctggttgagg 


aaggcgctaa 


gacggctcct 


660 


ggaaagaaga 


gacccataga 


atcccccgac 


tcctccacgg 


gcatcggcaa 


gaaaggccag 


720 


cagcccgcta 


aaaagaagct 


caactttggg 


cagactggcg 


actcagagtc 


agtgcccgac 


780 


cctcaaccaa 


tcggagaacc 


ccccgcaggc 


ccctctggtc 


tgggatctgg 


tacaatggct 


840 


gcaggcggtg 


gcgctccaat 


ggcagacaat 


aacgaaggcg 


ccgacggagt 


gggtaatgcc 


900 


tccggaaatt 


ggcattgcga 


ttccacatgg 


ctgggcgaca 


gagtcatcac 


caccagcacc 


960 


cgcacctggg 


ccctgcccac 


ctacaacaac 


cacctctaca 


agcagatatc 


aagtcagagc 


1020 


ggggctacca 


acgacaacca 


cttcttcggc 


tacagcaccc 


cctggggcta 


ttttgacttc 


1080 


aacagattcc 


actgccactt 


ctcatcacgt 


gactggcagc 


gact catcaa 


caacaactgg 


1140 


ggattccggc 


ccaagagact 


caacttcaag 


ctcttcaaca 


tccaggtcaa 


ggaggtcacg 


1200 


cagaatgaag 


gcaccaagac 


catcgccaat 


aaccttacca 


gcacgattca 


ggtctttacg 


1260 


gactcggaat 


accggctccc 


gtacgtcctc 


ggctctgcgc 


accagggctg 


cctgcctccg 


1320 


ttcccggcgg 


acgtcttcat 


gattcctcag 


tacgggtacc 


tgactctgaa 


caacggcagt 


1380 


caggccgtgg 


gccgttcctc 


cttctactgc 


ctggagtact 


ttccttctca 


aatgctgaga 


1440 


acgggcaaca 


actttgagtt 


cagctaccag 


tttgaggacg 


tgccttttca 


cagcagctac 


15C0 


gcgcacagcc 


aaagcctgga 


ccggctgatg 


aaccccctca 


tcgaccagta 


cctgtactac 


1560 


ctgtctcgga 


ctcagtccac 


gggaggtacc 


gcaggaactc 


agcagttgct 


attttctcag 


1620 


gccgggccta 


ataacatgtc 


ggctcaggcc 


aaaaactggc 


tacccgggcc 


ctgctaccgg 


1680 


cagcaacgcg 


tctccacgac 


actgtcgcaa 


aataacaaca 


gcaactttgc 


ttggaccggt 


1740 


gccaccaagt 


atcatctgaa 


tggcagagac 


tctctggtaa 


atcccggtgt 


cgctatggca 


1800 


acgcacaagg 


acgacgaaga 


gcgatttttt 


ccatccagcg 


gagtcttgat 


gtttgggaaa 


1860 


cagggagctg 


gaaaagacaa 


cgtggactat 


agcagcgtta 


tgctaaccag 


tgaggaagaa 


1920 


atcaaaacca 


ccaacccagt 


ggccacagaa 


cagtacggcg 


tggtggccga 


taacctgcaa 


1980 


cagcaaaacg 


ccgctcctat 


tgtaggggcc 


gtcaacagtc 


aaggagcctt 


acctggcatg 


2040 


gtctggcaga 


accgggacgt 


gtacctgcag 


ggtcctatct 


gggccaagat 


tcctcacacg 


2100 


gacggcaact 


ttcatccttc 


gccgctgatg 


ggaggctttg 


gactgaaaca 


cccgcctcct 


2160 


cagatcctga 


ttaagaatac 


acctgttccc 


gcggatcctc 


caactacctt 


cagtcaagcc 


2220 


aagccggcgt 


cgttcatcac 


gcagtacagc 


accggacagg 


tcagcgtgga 


aattgaatgg 


2280 


gagctgcaga 


aagagaacag 


caagcgctgg 


aacccagaga 


ttcagtatac 


ttccaactac 


2340 


tacaaatcta 


caaatgtgga 


ctttgctgtc 


aatactgagg 


gtacttattc 


agagcctcgc 


2400 


cccattggca 


cccgttacct 


cacccgtaac 


ctgtaattgc 


ctgttaatca 


ataaaccggt 


2460 
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<210> 34 

<211> 3106 

<212> DNA 

<213> new AA.V serotype, clone 42.5a 

<400> 34 



10 


gaattcgccc 


ttctacggct 


gcgtcaactg 


gaccaatgag 


aactttccct 


tcaacgattg 


60 




cgtcgacaag 


atggtgatct 


ggtgggagga 


gggcaagatg 


acggccaagg 


tcgtggagtc 


12 0 




cgccaaggcc 


at tctcggcg 


gcagcaaggt 


gcgcgtggac 


caaaagt: gca 


agtcgtccgc 


ICO 


15 


ccagatcgac 


cccacccccg 


t gatcgtcac 


ctccaacacc 


aacat gtgcg 


ccgt gattga 


240 




cgggaacagc 


accacct:t:cg 


agcaccagca 


gccgttgcag 


gaccggatigt 


tcaaatttga 


3C0 




actcacccgc 


cgtctggagc 


atgactttgg 


caaggcgaca 


aagcaggaag 


tcaaagagtt 


360 


20 


cttccgc tgg 


gcgcaggatc 


acgtgaccga 


ggtggcgcat 


gagttctacg 


tcagaaaggg 


420 




tggagccaac 


aagagacccg 


cccccgatga 


cgcggataaa 


agcgagccca 


agcgggcccg 


4o U 




cccctcagtc 


gcggatccat 


cgacgtcaga 


cgcggaagga 


gctccggtgg 


actt tgccga 


540 


25 


caggt accaa 


aacaaatgtt 


c tcgt cacgc 


gggcatgc tt 


cagat gc t gt 


ttccctgcaa 


oU U 




aacatgcgag 


agaatgaatc 


agaat ttcaa 


catttgcttc 


acgcacggga 


cc agagactg 


c cr\ 




ttcagaatgt 


ttccccggcg 


tgtcagaat c 


"tcaaccggtc 


gtcagaaaga 


ggac gtatcg 




30 






SI 'f~ 4* "t* ^ rf 

aut-uyu Lyyy 


rT \~ 

yoyyyu ut^oc^ 


^ rt "f" 4" 

ycaydUL-yo^i-L 


r< r"* "^T" r* rr f~* 

yo uuyyucuy 


7ft n 
/ o u 




cgatctggtc 


aacgt ggacc 


tggatg^actg 


"tgtttctgag 


caataaatga 


cttaaaccag 


Q /I n 
o ft U 






Cy auyy utac 


cuucco-yaLt 


gg c tcgagga 


caacctctct 


gagggcattc 


Q n n 
y U U 


35 


#^ a /"^ +- ^ /nr 4- #^ 

gcgagtggug 


gy ac u ugaaa 


cctggagccc 


cgaaacccaa 


agccaa.cca.g 


caaaagcagg 


Q n 
y c U 




a.oyciv_yy (_c^y 


yyy uuuyy uy 


t^LLCUUyyCU 


aocxo.y Ua.OCu 


^ "t" "t* 

oyyaccc u 


^a,v^yga.L. ut^y 


J. u 




d^ciCiy yy ^ 


y u> oy y u t^o.^^ 


yayy^o.yoiL«y 


^ ^ 

v^i^yQ^yyc^OL^u 


Si ^ ^ A f* 

(.•yayoat^yav^ 


ao.yycQoU^L'y 


1 n ft fi 

_L U O U 


40 


acaagcagct 


cgagcagggg 


gacaacccgtz 


accCcaagtza 


caaccacgcc 


gacgccgagt 


1 T /I A 
-LXft U 




ttcaggagcg 


tcttcaagaa 


gatacgtctt 


ttgggggcaa 


cctcgggcga 


gcagtcttcc 


1200 




gggccaagaa 


gcgggttctc 


gaacctctcg 


gtctggttga 


ggaaggcgct 


aagacggctc 


12 60 


45 


ctggaaagaa 


gagacccata 


gaatcccccg 


actcctccac 


gggcatcggc 


aagaaaggcc 


1320 




agcagcccgc 


taaaaagaag 


ctcaactttg 


ggcagactgg 


cgactcagag 


tcagtgcccg 


1380 


50 


acccccaacc 


tctcggagaa 


cctcccgccg 


cgccctcagg 


tctgggatct 


ggtacaatgg 


1440 


ctgcaggcgg 


tggcgcacca 


atggcagaca 


ataacgaagg 


cgccgacgga 


gtgggtaatg 


1500 




cctccggaaa 


ttggcattgc 


gattccacat 


ggctgggcga 


cagagtcatc 


accaccagca 


1560 


55 


cccgcacctg 


ggccctgccc 


acctacaaca 


accacctcta 


caagcagata 


tcaagtcaga 


1620 


gcggggctac 


caacgacaac 


cacttcttcg 


gctacagcac 


cccctggggc 


tattttgact 


1680 
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tcaacagatt ccactgccac ttctcaccac gtgactggca gcgactcatc aacaacaacc 1740 

ggggattccg gcccagaaag ctgcggttca agttgttcaa catccaggtc aaggaggtca 1800 

cgacgaacga cggcgttacg accatcgcta ataaccttac cagcacgatt caggtcttct 1860 

cggactcgga gtaccaactg ccgtacgtcc tcggctctgc gcaccagggc tgcctccctc 1920 

cgttccctgc ggacgtgttc atgattcctc agtacggata tctgactcta aacaacggca 1960 

gtcagtctgt gggacgttcc tccttctact gcctggagta ctttccttct cagatgctga 2040 

gaacgggcaa taactttgaa ttcagctacc agtttgagga cgtgcccttt cacagcagct 2100 

acgcgcacag ccaaagcctg gaccggctga tgaaccccct catcgaccag tacctgtact 2150 

acctgtctcg gactcagtcc acgggaggta ccgcaggaac tcagcagttg ctattttctc 2220 

aggccgggcc taataacatg tcggctcagg ccaaaaactg gctacccggg ccctgctacc 2280 

ggcagcaacg cgtctccacg acactgtcgc aaaataacaa cagcaacttt gcttggaccg 2340 

gtgccaccaa gtatcatctg aatggcagag actctctggt aaatcccggt gtcgctatgg 2400 

caacgcacaa ggacgacgaa gagcgatttt ttccatccag cggagtcttg atgtttggga 2 460 

aacagggagc tggaaaagac aacgtggact atagcagcgt tatgctaacc agtgaggaag 2520 

aaatcaaaac caccaaccca gtggccacag aacagtacgg cgtggtggcc gataacctgc 2580 

aacagcaaaa cgccgctcct attgtagggg ccgtcaacag tcaaggagcc ttacctggca 2 640 

tggcctggca gaaccgggac gtgtacctgc agggtcctat ctgggccaag attcctcaca 27 00 

cggacggcaa ctttcatcct tcgccgctga tgggaggctt tggactgaaa cacccgcctc 2760 

ctcagatcct gattaagaat acacctgttc ccgcggatcc tccaactacc ttcagtcaag 2 820 

ccaagctggc gtcgttcatc acgcagtaca gcaccggaca ggtcagcgtg gaaattgaat 2880 

gggagctgca gaaagagaac agcaagcgct ggaacccaga gattcagtat acttccaact 2 940 

actacaaatc tacaaatgtg gactttgctg tcaatactga gggtacttat tcagagcctc 3000 

gccccattgg cacccgttac ctcacccgta acctgtaatt gcctgttaat caataaaccg 3060 

gttaattcgt ttcagttgaa ctttggtctc tgcgaagggc gaattc 3106 

<210> 35 
<211> 2489 
<212> DNA 

<213> new AAV serotype, clone 42,10 
<400> 35 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtgaagt 120 

ccgccaaggc cattcatcat ctgctggggc gggctcccga gattgcttgc tcggcctgcg 180 

atctggtcaa cgtggacctg gatgactgtg tttctgagca ataaatgact taaaccaggt 240 

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 300 
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gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 360 

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 420 

aagggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa ggcctacgac 48 0 

aagcagctcg agcaggggga caacccgtac ctcaagtaca accacgccga cgccgagttt 540 

caggagcgtc ttcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 600 

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 660 

ggaaagaaga gacccataga atcccccgac tcctccacgg gcatcggcag gaaaggccag 720 

cagcccgcta aaaagaagct caactttggg cagactggcg actcagagtc agtgcccgac 780 

cctcaaccaa tcggagaacc ccccgcaggc ccctctggtc tgggatctgg tacaatggct 840 

gcaggcggtg gcgctccaat ggcagacaat aacgaaggcg ccgacggagt gggtaatgcc 900 

tccggaaatt ggcattgcga ttccacatgg ctgggcgaca gagtcatcac caccagcacc 960 

cgcacctggg ccctgcccac ctacaacaac cacctctaca agcagatatc aagtcagagc 1020 

ggggctacca acgacaacca cttcttcggc tacagcaccc cctggggcta ttttgacttc 1080 

aacagattcc actgccactt ctcaccacgt gactggcagc gactcatcaa caacaactgg 1140 

ggattccggc ccagaaagct gcggttcaag ttgttcaaca tccaggtcaa ggaggtcacg 1200 

acgaacgacg gcgttacgac catcgccaat aaccttacca gcacgattca ggtcttctcg 1260 

gactcggagt accaactgcc gtacgtcctc ggctctgcgc accagggctg cctccctccg 1320 

ttccctgcgg acgtgttcat gattcctcag tacggatatc tgactctaaa caacggcagt 1380 

cagtctgtgg gacgttcctc cttctactgc ctggagtact ttccttctca gatgctgaga 1440 

acgggcaata actttgaatt cagctacacc tttgaggaag tgcctttcca cagcagctat 1500 

gcgcacagcc agagcctgga ccggctgatg aatcccctca tcgaccagta cctgtactac 1560 

ctggcccgga cccagagcac tacggggtcc acaagggagc tgcagttcca tcaggctggg 1620 

cccaacacca tggccgagca atcaaagaac tggctgcccg gaccctgtta tcggcagcag 1680 

agactgtcaa aaaacataga cagcaacaac aacagtaact ttgcctggac cggggccact 1740 

aaataccatc tgaatggtag aaattcatta accaacccgg gcgtagccat ggccaccaac 1800 

aaggacgacg aggaccagtt ctttcccatc aacggagtgc tggtttttgg caaaacgggg I860 

gctgccaaca agacaacgct ggaaaacgtg ctaatgacca gcgaggagga gatcaaaacc 1920 

accaatcccg tggctacaga agaatacggt gtggtctcca gcaacctgca atcgtctacg 1980 

gccggacccc agacacagac tgtcaacagc cagggggctc tgcccggcat ggtctggcag 2040 

aaccgggacg tgtacctgca gggtcccatc tgggccaaaa ttcctcacac ggacggcaac 2100 

tttcacccgt ctcccctgat gggcggattt ggactcaaac acccgcctcc tcaaattctc 2160 

atcaaaaaca ccccggtacc tgctaatcct ccagaggtgt ttactcctgc caagtttgcc 2220 
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tcatttatca 


cgcagtacag 


caccggccag 


gtcagcgtgg 


agatcgagtg 


ggaactgcag 


2280 


aaagaaaaca 


gcaaacgctg 


gaatccagag 


attcagtaca 


cctcaaatta 


tgccaagtct 


2340 


aataatgtgg 


aatttgctgt 


caacaacgaa 


ggggtttata 


ctgagcctcg 


ccccattggc 


2400 


acccgttacc 


tcacccgtaa 


cctgtaattg 


cctgttaatc 


aataaaccgg 


ttaattcgtt 


2460 


tcagttgaac 


tttggtcaag 


ggcgaattc 








2489 


<210> 36 
<211> 2495 

<212> DNA 

<213> new AAV serotype, clone 42.3b 








<400> 36 
gaattcgccc 


tttctacggc 


tgcgtcaact 


agaccaatga 


gaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


12 0 


ccgccaaggc 


cattcatcat 


ctgctggggc 


gggctcccga 


gattgcttgc 


tcggcctgcg 


180 


atctggtcaa 


cgtggacctg 


gatgactgtg 


tttctgagca 


ataaatgact 


taaaccaggt 


240 


atggctgccg 


atggttatct 


tccagattgg 


ctcgaggaca 


acctctctga 


gggcattcgc 


300 


gagtggtggg 


acttgaaacc 


tggagccccg 


aaacccaaag 


ccaaccagca 


aaagcaggac 


360 


gacggccggg 


gtctggtgct 


tcctggctac 


aagtacctcg 


gacccttcaa 


cggactcgac 


420 


aagggagagc 


cggtcaacga 


ggcagacgcc 


gcggccctcg 


agcacgacaa 


ggcctacgac 


480 


aagcagctcg 


agcaggggga 


caacccgtac 


ctcaagtaca 


accacgccga 


cgccgagttt 


540 


caggagcgtc 


ttcaagaaga 


tacgtctttt 


gggggcaacc 


tcgggcgagc 


agtcttccag 


600 


gccaagaagc 


gggttctcga 


acctctcggt 


ctggttgagg 


aaggcgctaa 


gacggctcct 


660 


ggaaagaaga 


gacccataga 


atcccccgac 


tcctccacgg 


gcatcggcaa 


gaaaggccag 


720 


cagcccgcta 


aaaagaagct 


caactttggg 


cagactggcg 


actcagagtc 


agtgcccgac 


780 


cctcaaccaa 


tcggagaacc 


ccccgcaggc 


ccctctggtc 


tgggatctgg 


tacaatggct 


840 


gcaggcggtg 


gcgctccaat 


ggcagacaat 


aacgaaggcg 


ccgacggagt 


gggtaatgcc 


900 


tccggaaatt 


ggcattgcga 


ttccacatgg 


ctgggcgaca 


gagtcatcac 


caccagcacc 


960 


cgcacctggg 


ccctgcccac 


ctacaacaac 


cacctctaca 


agcagatatc 


aagtcagagc 


1020 


ggggctacca 


acgacaacca 


cttcttcggc 


tacagcaccc 


cctggggcta 


ttttgacttc 


1080 


aacagattcc 


actgccactt 


ctcaccacgt 


gactggcagc 


gactcatcaa 


caacaactgg 


1140 


ggattccggc 


ccagaaagct 


gcggttcaag 


ttgttcaaca 


tccaggtcaa 


ggaggtcacg 


1200 


acgaacgacg 


gcgttacgac 


catcgctaat 


aaccttacca 


gcacgattca 


ggtcttctcg 


1260 


gactcggagt 


accaactgcc 


gtacgtcctc 


ggctctgcgc 


accagggctg 


cctccctccg 


1320 


ttccctgcgg 


acgtgttcat 


gattcctcag 


tacggatatc 


tgactctaaa 


caacggcagt 


1380 


cagtctgtgg 


gacgttcctc 


cttctactgc 


ctggagtact 


ttccttctca 


gatgctgaga 


1440 
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acgggcaata actttgaatt cagctacacc tttgaggaag tgcctttcca cagcagctat 1500 

gcgcacagcc agagcctgga ccggctgatg aatcccctca tcgaccagta cctgtactac 1560 

ctggcccgga cccagagcac tacggggtcc acaagggagc tgcagttcca tcaggctggg 1620 

cccaacacca tggccgagca atcaaagaac tggctgcccg gaccctgtta tcggcagcag 1680 

agactgtcaa aaaacataga cagcaacaac accagtaact ttgcctggac cggggccact 1740 

aaataccatc tgaatggtag aaattcatta accaacccgg gcgtagccat ggccaccaac 1800 

aaggacgacg aggaccagtt ctttcccatc aacggagtgc tggtttttgg caaaacgggg 1850 

gctgccaaca agacaacgct ggaaaacgtg ctaatgacca gcgaggagga gatcaaaacc 1920 

accaatcccg tggctacaga acagtacggt gtggtctcca gcaacctgca atcgtctacg 1980 

gccggacccc agacacagac tgtcaacagc cagggggctc tgcccggcat ggtctggcag 2040 

aaccgggacg tgtacctgca gggtcccatc tgggccaaaa ttcctcacac ggacggcaac 2100 

tttcacccgt ctcccctgat gggcggattt ggactcaaac acccgcctcc tcaaattctc 2160 

atcaaaaaca ccccggtacc tgctaatcct ccagaggtgt ttactcctgc caagtttgcc 2220 

tcatttatca cgcagtacag caccggccag gtcagcgtgg agatcgagtg ggaactgcag 228 0 

aaagaaaaca gcaaacgctg gaatccagag attcagtaca cctcaaatta tgccaagtct 2340 

aataatgtgg aatttgctgt caacaacgaa ggggtttata ctgagcctcg ccccattggc 2400 

acccgttacc tcacccgtaa cctgtaattg cctgttaatc aataaaccgg ttaattcgtt 2460 

tcagttgaac tttggtctct gcgaagggcg aattc 2495 



<210> 37 

<211> 3098 

<212> DNA 

<213> new AAV serotype, clone 42.11 

<400> 37 



40 



55 



gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


120 


ccgccaaggc 


cattctcggc 


ggcagcaagg 


tgcgcgtgga 


ccaaaagtgc 


aagtcttccg 


180 


cccagatcga 


tcccaccccc 


gtgatcgtca 


cttccaacac 


caacatgtgc 


gccgtgattg 


240 


acgggaacag 


caccaccttc 


gagcaccagc 


agccgttaca 


agaccggatg 


ttcaaatttg 


300 


aactcacccg 


ccgtctggag 


cacgactttg 


gcaaggtgac 


aaagcaggaa 


gtcaaagagt 


360 


tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


420 


gtggagccaa 


caagagaccc 


gcccccgatg 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 


gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 


acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 


agacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accggagact 


660 
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gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 


ggaaactctg 


tgccattcat 


catctgctgg 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 


gcgatctggt 


caacgtggac 


ctggatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 


ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 


cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 


gacaagggag 


agccggtcaa 


cgcggcggac 


gcagcggccc 


tcgagcacga 


caaggcctac 


1080 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


cgacgccgag 


1140 


tttcaggagc 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 


caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 


cctggaaaga 


agagacccat 


agaatccccc 


gactcctcca 


cgggcatcgg 


caagaaaggc 


1320 


cagcagcccg 


ctaaaaagaa 


gctcaacttt 


gggcagactg 


gcgactcaga 


gtcagtgccc 


1380 


gaccctcaac 


caatcggaga 


accccccgca 


ggcccctctg 


gtctgggatc 


tggtacaatg 


1440 


gctgcaggcg 


gtggcgctcc 


aatggcagac 


aataacgaag 


gcgccgacgg 


agtgggtaat 


1500 


gcctccggaa 


attggcattg 


cgattccaca 


tggctgggcg 


acagagtcat 


caccaccagc 


1560 


acccgcacct 


gggccctgcc 


cacctacaac 


aaccacctct 


acaagcagat 


atcaagtcag 


1620 


agcggggcta 


cc aacgacaa 


ccacttcttc 


ggctacagca 


ccccctgggg 


ctattttgac 


1680 


ttcaacagat 


tccactgcca 


cttctcacca 


cgtgactggc 


agcgactcat 


caacaacaac 


1740 


tggggattcc 


ggcccagaaa 


gctgcggttc 


aagttgttca 


acatccaggt 


caaggaggtc 


1800 


acgacgaacg 


acggcgttac 


gaccatcgct 


aataacctta 


ccagcacgat 


tcaggtcttc 


1860 


tcggactcgg 


agtaccaact 


gccgtacgtc 


ctcggctctg 


cgcaccaggg 


ctgcctccct 


1920 


ccgttccctg 


cggacgtgtt 


catgattcct 


cagtacggat 


atctgactct 


aaacaacggc 


1980 


agtcagtctg 


tgggacgttc 


ctccttctac 


tgcctggagt 


actttccttc 


tcagatgctg 


2040 


agaacgggca 


ataactttga 


attcagctac 


acctttgagg 


aagtgccttt 


ccacagcagc 


2100 


tatgcgcaca 


gccagagcct 


ggaccggctg 


atgaatcccc 


tcatcgacca 


gtacctgtac 


2160 


tacctggccc 


ggacccagag 


cactacgggg 


tccacaaggg 


agctgcagtt 


ccatcaggct 


2220 


gggcccaaca 


ccatggccga 


gcaatcaaag 


aactggctgc 


ccggaccctg 


ttatcggcgg 


2280 


cagagactgt 


caaaagacat 


agacagcaac 


aacaacagta 


actttgcctg 


gaccggggcc 


2340 


actaaatacc 


atctgaatgg 


tagaaattca 


ttaaccaacc 


cgggcgtagc 


catggccacc 


2400 


aacaaggacg 


acgaggacca 


gttctttccc 


atcaacggag 


tgctggtttt 


tggcaaaacg 


2460 


ggggctgcca 


acaagacaac 


gctggaaaac 


gtgctaatga 


ccagcgagga 


ggagatcaaa 


2520 


accaccaatc 


ccgtggctac 


agaagaatac 


ggtgtggtct 


ccagcaacct 


gcaatcgtct 


2580 
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acggccggac 


cccagacaca 


gactgtcaac 


agccaggggg 


ctctgcccgg 


catggtctgg 


2640 




cagaaccggg 


acgtgtacct 


gcagggtccc 


atctgggcca 


aaattcctca 


cacggacggc 


2700 


5 


aactttcacc 


cgtctcccct 


gatgggcgga 


tttggactca 


aacacccgcc 


tcctcaaatt 


2760 




ctcatcaaaa 


acaccccggt 


acctgctaat 


cctccagagg 


tgtttactcc 


tgccaagttt 


2820 


10 


gcctcattta 


tcacgcagta 


cagcaccggc 


caggtcagcg 


tggagatcga 


gtgggaactg 


2880 


cagaaagaga 


acagcaaacg 


ctggaatcca 


gagattcagt 


acacctcaaa 


ttatgccaag 


2940 




tctaataatg 


tggaatttgc 


tgtcaacaac 


gaaggggttt 


atactgagcc 


tcgccccatt 


3000 


15 


ggcacccgtt 


acctcacccg 


taacctgtaa 


ttacttgtta 


atcaataaac 


cggttgattc 


3060 


gtttcagttg 


aactttggtc 


tctgcgaagg 


gcgaattc 






3098 


20 


<210> 38 
<211> 3276 
<212> DNA 

<213> new AA.V serotype, clone 42.6a 










<400> 38 
gaattcgccc 


ttcgcagaga 


ccaaagttca 


actgaaacga 


attaaccggt 


ttattgatta 


60 


25 


acaggcaatt 


acaggttacg 


ggtgaggtaa 


cgggtgccaa 


tggggcgagg 


ctcagtataa 


120 




accccttcgt 


tgttgacagc 


aaattccaca 


ttattagact 


tggcataatt 


tgaggtgtac 


180 




tgaatctctg 


gattccagcg 


tttgctgttt 


tctttctgca 


gttcccactc 


gatctccacg 


240 


30 


ctgacctggc 


cggtgctgta 


ctgcgtgata 


aatgaggcaa 


acttggcagg 


agtaaacacc 


300 




tctggaggat 


tagcaggtac 


cggggtgttt 


ttgatgagaa 


tttgaggagg 


cgggtgtttg 


360 




agtccaaatc 


cgtccatcag 


gggagacggg 


tgaaagttgc 


cgtccgtgtg 


aggaattttg 


420 


35 


gcccagatgg 


gaccctgcag 


gtacacgtcc 


cggttctgcc 


agaccatgcc 


gggcagagcc 


480 




ccctggctgt 


tgacagtctg 


tgtctggggt 


ccggccgtag 


acgattgcag 


gttgctggag 


540 




accacaccgt 


attcttctgt 


agccacggga 


ttggtggttt 


tgatctcctc 


ctcgctggtc 


600 


40 


attagcacgt 


tttccagcgt 


tgtcttgttg 


gcagcccccg 


ttttgccaaa 


aaccagcact 


660 




ccgttgatgg 


gaaagaactg 


gtcctcgtcg 


tccttgttgg 


tggccatggc 


tacgcccggg 


720 




ttggttaatg 


aatttctacc 


attcacatgg 


tatttagtgg 


ccccggtcca 


ggcaaagtta 


780 


45 


ctgttgttgt 


tgctgtctat 


gttttttgac 


agtctctgct 


gccgataaca 


gggtccgggc 


840 




agccagttct 


ttgattgctc 


ggccatggtg 


ttgggcccag 


cctgatggaa 


ctgcagctcc 


900 


50 


cttgtggacc 


ccgtagtgct 


ctgggtccgg 


gccaggtagt 


acaggtactg 


gtcgatgagg 


960 


ggattcatca 


gccggtccag 


gctctggcta 


tgcgcatagc 


tgctgtggaa 


aggcacttcc 


1020 




tcaaaggtgt 


agctgaattc 


aaagttattg 


cccgttctca 


gcatctgaga 


aggaaagtac 


1080 


55 


tccaggcagt 


agaaggagga 


acgtcccaca 


gactgactgc 


cgttgtttag 


agtcagatat 


1140 


ccgtactgag 


gaatcatgaa 


cacgtccgca 


gggaacggag 


ggaggcagcc 


ctggtgcgca 


1200 
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gagccgagga cgtacggcag ttggtactcc gagtccgaga agacctgaat cgtgctggta 12 60 
aggttattag cgatggtcgt aacgccgtcg tccgtcgtga cctccttgac ctggatgttg 1320 
aacaacttga accgcagctt tctgggccgg aatccccagt tgttgttgat gagtcgctgc 138 0 

cagtcacgtg gtgagaagtg gcagtggaat ctgttaaagt caaaataccc ccagggggtg 1440 

ctgtagccga agtaggtgtt gtcgttggtg cttcctcccg atgtcccgtt ggagatttgc 1500 

ttgtagaggt ggttgttgta ggtggggagg gcccaggttc gggtgctggt ggtgatgact 1560 

ctgtcgccca gccatgtgga atcgcaatgc caatttcctg aggaactacc cactccgtcg 1620 

gcgccttcgt tattgtctgc cattggagcg ccaccgcctg cagccattgt accagatccc 1680 

agaccagagg ggcctgcggg gggttctccg attggttgag ggtcgggcac tgactctgag 1740 

tcgccagtct gcccaaagtt gagtctcttt ttcgcgggct gctggcctgt cttgccgatg 1800 

cccgtagagg agtctggaga acgctggggt gatggctcta ccggtctctt ctttccagga 1860 

gccgtcttag cgccttcctc aaccagaccg agaggttcga gaacccgctt cttggcctgg 1920 

aagactgctc gcccgaggtt gcccccaaaa gacgtatctt cttgaagacg ctcctgaaac 1930 

tcggcgtcgg cgtggttgta cttgaggtac gggttgtccc cctgctcgag ctgcttgtcg 2040 

taggccttgt cgtgctcgag ggccgcggcg tctgcctcgt tgaccggctc tcccttgtcg 2100 

agtccgttga agggtccgag gtacttgtag ccaggaagca ccagaccccg gccgtcgtcc 2160 

tgcttttgct ggttggcttt gggtttcggg gctccaggtt tcaagtccca ccactcgcga 2220 

atgccctcag agaggttgtc ctcgagccaa tctggaagat aaccatcggc agccatacct 228 0 

ggtttaagtc atttattgct cagaaacaca gtcatccagg tccacgttga ccagatcgca 2340 

35 ggccgagcaa gcaatctcgg gagcccgccc cagcagatga tgaatggcac agagtttccg 2400 

atacgtcctc tttctgacga ccggttgaga ttctgacacg ccggggaaac attctgaaca 2460 

gtctctggtc ccgtgcgtga agcaaatgtt gaaattctga ttcattctct cgcatgtctt 2520 

40 gcagggaaac agcatctgaa gcatgcccgc gtgacgagaa cacttgtttt ggtacctgtc 2580 

ggcaaagtcc accggagctc cttccgcgtc tgacgtcgat ggatgcaaaa tgtcgcaaaa 2640 

gcactcacgt gacagctaat acaggaccac tcccctatga cgtgatttac gtcagcgcta 2700 

45 tgcccgcgtg acgagaacat ttgttttggt acctgtcggc aaagtccacc ggagctcctt 2760 

ccgcgtctga cgtcgatgga tccgcgactg aggggcaggc ccgcttgggc tcgcttttat 2820 

ccgcgtcatc gggggcgggt ctcttgttgg ctccaccctt tctgacgtag aactcatgcg 2880 

50 ccacctcggt cacgtgatcc tgcgcccagc ggaagaactc tttgacttcc tgctttgtca 2940 

ccttgccaaa gtcatgctcc agacggcggg tgagttcaaa tttgaacatc cggtcctgca 3000 

acggctgctg gtgctcgaag gtggtgctgt tcccgtcaat cacggcgcac atgttggtgt 3060 

55 tggaagtgac gatcacgggg gtgggatcga tctgggcgga agacttgcac ttttggtcca 3120 
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cgcgcacctt gctgccgccg agaatggcct tggcggactc cacgaccttg gccgtcatct 3180 
tgccctcctc ccaccagatc accatcttgt cgacgcaatc gttgaaggga aagttctcat 3240 

5 

tggtccagtt gacgcagccg tagaaagggc gaattc 3276 

<210> 39 

<211> 3084 

10 <212> DNA 

<213> new AAV serotype, clone 43.1 

<400> 39 

gaattcgccc tttctacggc tgcatcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 12 0 

ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 180 

cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 

acgggaacag caccaccttc gagcaccagc agccgttgca ggaccggatg ttcaagttcg 300 

aactcacccg ccgtctggag cacgactttg gcaaggtgac caagcaggaa gtcaaagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 420 



20 



25 



30 



35 



40 



45 



50 



55 



480 



gcggagccag caaaagaccc gcccccgatg acgcggatat aagcgagccc aagcgggcct 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

aaacgtgcga gaaaatgaat cagaatttca acatttgctt cacgcacggg gtcagagact 660 

gctcagaatg tttccccggt gcatcagaat ctcaaccggt cgtcagaaaa aaaacgtatc 720 

agaaactgtg tgccattcat catctgctgg ggcgggcacc cgagattgct tgctcggcct 780 

gcgatctggt caacgtggac ctggacgact gtgtttctga gcaataaatg acttaaacca 840 

ggtatggctg ccgatggtta tcttccagat tggcttgagg acaacctctc tgagggcatt 900 

cgcgagtggt gggacctgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 114 0 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagaccggt agagccatca cctcagcgtt cccccgactc ctccacgggc 1320 

atcggcaaga aaggccacca gcccgcgaga aagagactga actttgggca gactggcgac 1380 

tcggagtcag tccccgaccc tcaaccaatc ggagaaccac cagcaggccc ctctggtctg 1440 

ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 1500 

gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 1560 
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gtcatcacca 


ccagcacccg 


aacctgggcc 


ctgcccacct 


acaacaacca 


tctctacaag 


1620 


caaatctcca 


acgggacatc 


gggaggaagc 


actaacgaca 


acacctactt 


tggctacagc 


1680 


accccctggg 


ggtattttga 


cttcaacaga 


ttccactgcc 


acttctcacc 


acgtgactgg 


1740 


cagcgactca 


tcaacaataa 


ctggggattc 


cggcccaaga 


gactcaactt 


caagctcttc 


1800 


aacatccagg 


tcaaggaggt 


cacgcagaat 


gaaggcacca 


agaccatcgc 


caataacctt 


1860 


accagcacga 


ttcaggtgtt 


tacggactcg 


gaataccagc 


tcccgtacgt 


ccccggctct 


1920 


gcgcaccagg 


gctgcctccc 


tccgttcccg 


gcggacgtct 


tcatgattcc 


tcagtacggg 


1980 


tatctgaccc 


taaacaatgg 


cagtcaggct 


gtgggccgtt 


cctccttcta 


ctgcctggaa 


2040 


tacttccctt 


ctcaaatgct 


gaggacgggc 


aacaactttg 


aattcagcta 


caccttcgag 


2100 


gacgtgcctt 


tccacagcag 


ctacgcgcac 


agccagagcc 


tggaccggct 


gatgaaccct 


2160 


ctcatcgacc 


agtacctgta 


ttacttatcc 


agaactcagt 


ccacaggagg 


aactcaaggt 


2220 


actcagcaat 


tgttattttc 


tcaagccggg 


cccgcaaaca 


tgtcggctca 


ggccaagaac 


2280 


tggctacctg 


gaccgtgtta 


ccgtcagcaa 


cgagtttcca 


cgacactgtc 


gcaaaacaac 


2340 


aacagcaatt 


ttgcttggac 


cggtgccacc 


aagtatcacc 


tgaatggcag 


agactccctg 


2400 


gttaatcccg 


gcgttgccat 


ggctacccac 


aaggacgacg 


aggagcgctt 


cttcccgtca 


2460 


agcggagttc 


taatgtttgg 


caagcagggg 


gctggaaaag 


acaatgtgga 


ctacagcagc 


2520 


gtgatgctca 


ccagcgaaga 


agaaattaaa 


actactaacc 


cagtggctac 


agagcagtat 


2580 


ggtgtggtgg 


cagacaacct 


gcagcagacc 


aacggagctc 


ccattgtggg 


aactgtcaac 


2640 


agccaggggg 


ccttacctgg 


tatggtctgg 


caaaaccggg 


acgtgtacct 


gcagggcccc 


2700 


atctgggcca 


aaattcctca 


cacggacggc 


aactttcatc 


cttcgccgct 


gatgggaggc 


2760 


tttggactga 


aacacccgcc 


tcctcagatc 


ctggtgaaaa 


acactcctgt 


tcctgcggat 


2820 


cctccgacca 


ccttcagcca 


ggccaagctg 


gcttctttta 


tcacgcagta 


cagcaccgga 


2880 


caggtcagcg 


tggaaatcga 


atgggagctg 


cagaaagaaa 


acagcaagcg 


ctggaaccca 


2940 


gagattcagt 


atacttccaa 


ctactacaaa 


tctacaaatg 


tggactttgc 


tgtcaatact 


3000 


gagggtactt 


attcagagcc 


tcgccccatt 


ggcactcgtt 


atctcacccg 


taatctgtaa 


3060 


ttgcttgtta 


atcaataaac 


cggt 








3084 



<210> 40 
<211> 2370 
<212> DNA 

<213> new AAV serotype, clone 43.5 
<400> 40 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 
gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 
ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 18 0 
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cccagatcga 


ccccaccccc 


gtgatcgtca 


cctccaacac 


caacatgtgc 


gccgtgattg 


240 


acgggaacag 


caccaccttc 


gagcaccagc 


agccgttgca 


ggaccggatg 


ttcaagttcg 


300 


aactcacccg 


ccgtctggag 


cacgactttg 


gcaaggtgac 


caagcaggaa 


gtcaaagagt 


360 


tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


420 


gcggagccag 


caaaagaccc 


gcccccgatg 


acgcggatat 


aagcgagccc 


aagcgggcct 


480 


gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 


acaggtacca 


aaacaaatgt 


tctcg::cacg 


cgggcatgct 


tcagacgctg 


tttccctgca 


600 


aaacgtgcga 


gagaatgaat 


cagaarttca 


acatttgctt 


cacgcacggg 


gtcagagact 


660 


gctcagaatg 


tttccccggt 


gcatcagaat 


ctcaaccggt 


cgtcagaaaa 


aaaacgtatc 


720 


agaaactgtg 


tgccattcat 


catctgctgg 


ggcgggcacc 


cgagattgct 


tgctcggcc t 


780 


gcgatctggt 


caacgtggac 


ctggacgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 


ggtatggctg 


ccgatggtta 


tcttccagat 


tggcttgagg 


acaacctctc 


tgagggcatt 


900 


cgcgagtggt 


gggacctgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 


gacaaggggg 


agcccgtcaa 


cgcggcggac 


gcagcggccc 


tcgagcacga 


caaggcctac 


1080 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


cgacgccgag 


1140 


tttcaggagc 


gtctgcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 


caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 


cctggaaaga 


agagaccggt 


agagccatca 


cctcagcgtt 


cccccgactc 


ctccacgggc 


1320 


atcggcaaga 


aaggccacca 


gcccgcgaga 


aagagactga 


actttgggca 


gactggcgac 


1360 


tcggagtcag 


tccccgaccc 


tcaaccaatc 


ggagaaccac 


cagcaggccc 


ctctggtctg 


1440 


ggatctggta 


caatggctgc 


aggcggtggc 


gctccaatgg 


cagacaataa 


cgaaggcgcc 


15C0 


gacggagtgg 


gtagttcctc 


aggaaattgg 


cattgcgatt 


ccacatggct 


gggcgacaga 


1560 


gtcatcacca 


ccagcacccg 


aacctgggcc 


ctgcccacct 


acaacaacca 


tctctacaag 


1620 


caaatctcca 


acgggacatc 


gggaggaagc 


actaacgaca 


acacctactt 


tggctacagc 


1680 


accccctggg 


ggtattttga 


cttcaacaga 


ttccactgcc 


acttctcacc 


acgtgactgg 


1740 


cagcgactca 


tcaacaataa 


ctggggattc 


cggcccaaga 


gactcaactt 


caagctcttc 


1800 


aacatccagg 


tcaaggaggt 


cacgcagaat 


gaaggcacca 


agaccatcgc 


caat aacc tt 


1860 


accagcacga 


ttcaggtgtt 


tacggactcg 


gaataccagc 


tcccgtacgt 


cctcggctct 


1920 


gcgcaccagg 


gctgcctccc 


tccgttcccg 


gcggacgtct 


tcatgattcc 


tcagtacggg 


1980 


tatctgaccc 


taaacaatgg 


cagtcaggct 


gtgggccgtt 


cctccttcta 


ctgcctggaa 


2040 


tacttccctt 


ctcaaatgct 


gaggacgggc 


aacaactttg 


aattcagcta 


caccttcgag 


2100 
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gacgtgcctt tccacagcag ctacgcgcac agccagagcc tggaccggct gatgaaccct 2160 

ctcatcgacc agtacctgta ttacttatcc agaactcagt ccacaggagg aactcaaggt 2220 

actcagcaat tgttattttc tcaagccggg cccgcaaaca tgtyggctca ggccaagaac 2280 

tggctacctg gaccgtgtta ccgtcagcaa cgagtttcca cgacactgtc gcaaaacaac 2340 

aacagcaatt ttgctggacc ggtgccacca 2370 

<210> 41 
<211> 3123 
<212> DNA 

<213> new AAV serotype, clone 43.12 
<400> 41 

gaattcgccc ttggctgcgt caactggacc aatgagaact ttcccttcaa cgattgcgtc 60 

gacaagatgg tgatctggtg ggaggagggc aagatgacgg ccaaggtcgt ggagtccgcc 120 

aaggccattc tcggcggcag caaggtgcgc gtggaccaaa agtgcaagtc gtccgcccag 18 0 

atcgacccca cccccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg 240 

aacagcacca ccttcgagca ccagcagccg ttgcaggacc ggatgttcaa gttcgaactc 300 

acccgccgtc tggagcacga ctttggcaag gtgaccaagc aggaagtcaa agagttcttc 360 

cgctgggcgc aggatcacgt gaccgaggtg gcgcatgagt tctacgtcag aaagggcgga 420 

gccagcaaaa gacccgcccc cgatgacgcg gatataagcg agcccaagcg ggcctgcccc 480 

tcagtcgcgg atccatcgac gtcacacgcg gaaggagctc cggtggactt tgccgacagg 540 

taccaaaaca aatgttctcg tcacgcgggc atgctccaga tgctgtttcc ctgcaaaacg 600 

tgcgagagaa tgaatcagaa tttcaacatt tgcttcacgc acggggtcag agactgctca 660 

gaatgtttcc ccggtgcatc agaatctcaa ccggtcgtca gaaaaaaaac gtatcagaaa 720 

ctgtgtgcca ttcatcatct gctggggcgg gcacccgaga ttgcttgctc ggcctgcgat 780 

ctggtcaacg tggacctgga cgactgtgtt tctgagcaat aaatgactta aaccaggtat 840 

ggctgccgat ggttatcttc cagattggct tgaggacaac ctctctgagg gcattcgcga 900 

gtggtgggac ctgaaacctg gagccccgaa acccaaagcc aaccagcaaa agcaggacga 960 

cggccggggt ctggtgcttc ctggctacaa gtacctcgga cccttcaacg gactcgacaa 1020 

gggggagccc gtcaacgcgg cggacgcagc ggccctcgag cacgacaagg cctacgacca 108 0 

gcagctcaaa gcgggtgaca atccgtacct gcggtataac cacgccgacg ccgagtttca 114 0 

ggagcgtctg caagaagata cgtcttttgg gggcaacctc gggcgagcag tcttccaggc 1200 

caagaagcgg gttctcgaac ctctcggtct ggttgaggaa ggcgctaaga cggctcctgg 1260 

aaagaagaga ccggtagagc catcacctca gcgttccccc gactcctcca cgggcatcgg 1320 

caagaaaggc caccagcccg cgagaaagag actgaacttt gggcagactg gcgactcgga 1380 

gtcagtcccc gaccctcaac caatcggaga accaccagca ggcccctctg gtctgggatc 1440 
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<400> 42 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

5 gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgtgtgga ccaaaagtgc aagtcttccg 180 

cccagatcga tcccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 

10 acgggaacag cgccaccttc gagcaccagc agccgttgca ggaccggatg ttcaaatttg 300 

aactcacccg ccgtctggag catgactttg gcaaggtgac gaagcaggaa gtcaaagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttccac gtcagaaagg 420 

^5 gtggagccaa caagagaccc gcccccgatg acgcggatat aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 660 

gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 

ggaaactctg tgcgattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 780 

gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg actt aaacca 840 

ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 900 

cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaagcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataatcacgc cgacgccgag 1140 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1250 

cctggaaaga agagactggt agagcagtcg ccacaagagc cagactcctc ctcgggcatc 1320 

ggcaagacag gccagcagcc cgctaaaaag agactcaatt ttggtcagac tggcgactca 138 0 

gagtcagtcc ccgacccaca acctctcgga gaacctccag cagccccctc aggtctggga 1440 

cctaatacaa tggcttcagg cggtggcgct ccaatggcag acaataacga aggcgccgac 1500 

ggagtgggta attcctcggg aaattggcat tgcgattcca catggctggg ggacagagtc 1560 

atcaccacca gcacccgaac ctgggccctg cccacctaca acaaccacct ctacaagcaa 1620 

atctccaacg gcacctcggg aggaagcacc aacgacaaca cctattttgg ctacagcacc 1680 

ccctgggggt attttgactt caacagattc cactgtcact tttcaccacg tgactggcaa 174 0 

cgactcatca acaacaattg gggattccgg cccaaaagac tcaacttcaa gctgttcaac 1800 

atccaggtca aggaagtcac gacgaacgaa ggcaccaaga ccatcgccaa taatctcacc 1860 

agcaccgtgc aggtctttac ggactcggag taccagttac cgtacgtgct aggatccgct 1920 
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<211> 3117 
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<213> new AAV serotype, clone 43. 
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2760 


gcacccgcct 
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ggaaatcgag 
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agattcaata 
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cacttccaac 
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ggactttgct 
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25 <210> 44 

<211> 3121 

<212> DNA 

<213> new AAV serotype, clone 43.23 



<400> 44 

gaattcgccc ttctacggct gcgtcaactg gaccaatgag aactttccct tcaacgattg 60 

cgtcgacaag atggtgatct ggtgggagga gggcaagatg acggccaagg tcgtggagtc 120 

cgccaaggcc attctcggcg gcagcaaggt gcgtgtggac caaaagtgca agtcttccgc 180 

ccagatcgat cccacccccg tgatcgtcac ctccaacacc aacatgtgcg ccgtgattga 240 

cgggaacagc accaccttcg agcaccagca gccgttgcag gaccggatgt tcaaatttga 300 

actcacccgc cgtctggagc atgactttgg caaggtgacg aagcaggaag tcaaagagtt 360 

cttccgctgg gcgcaggatc acgtgaccga ggtggcgcat gagttccacg tcagaaaggg 420 

tggcgccaac aagagacccg cccccgatga cgcggatata agcgagccca agcgggcctg 480 

cccctcagtc gcggatccat cgacgtcaga cgcggaagga gctccggtgg actttgccga 540 

caggtaccaa aacaaatgtt ctcgtcacgc gggcatgctt cagatgctgt ttccctgcaa 600 

gacatgcgag agaatgaatc agaatttcaa catttgcttc acgcacggga ccagagactg 660 

ttcagaatgt ttccccggcg tgtcagaatc tcaaccggtc gtcagaaaga ggacgtatcg 720 

gaaactctgt gcgattcatc atctgctggg gcgggctccc gagattgctt gctcggcctg 780 

cgatctggtc aacgtggacc tggatgactg tgtttctgag caataaatga cttaaaccag 840 

gtatggctgc cgatggttat cttccagatt ggctcgagga caacctctct gagggcattc 900 

gcgagtggtg ggacttgaaa cctggagccc cgaaacccaa agccaaccag caaaagcagg 960 
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aacctccagc 


agccccctca 


ggtctgggac 


1440 


ctaatacaat 


ggcttcaggc 


ggtggcgctc 


caatggcaga 


caataacgaa 


ggcgccgacg 


1500 


gagtgggtaa 


ttcctcggga 


aattggcatt 


gcgattccac 


atggctgggg 


gacagagtca 


1560 


tcaccaccag 


cacccgaacc 


tgggccctgc 


ccacctacaa 


caaccacctc 


tacaagcaaa 


1620 


tctccaacgg 


cacctcggga 


ggaagcacca 


acgacaacac 


ctattttggc 


tacagcaccc 


1680 


cctgggggta 


ttttgacttc 


aacagattcc 


actgtcactt 


ttcaccacgt 


gactggcaac 


1740 


gactcatcaa 


caacaattgg 


ggattccggc 


ccaaaagact 


caacttcaag 


ctgttcaaca 


1800 


tccaggtcaa 


ggaagtcacg 


acgaacgaag 


gcaccaagac 


catcgccaat 


aatctcacca 


1860 


gcaccgtgca 


ggtctttacg 


gacttggagt 


accagttacc 


gtacgtgcta 


ggatccgctc 


1920 


accagggatg 


tctgcctccg 


ttcccggcgg 


acgtcttcat 


ggttcctcag 


tacggctatt 


1980 


taactttaaa 


caatggaagc 


caagccctgg 


gacgttcctc 


cttctactgt 


ctggagtatt 


2040 


tcccatcgca 


gatgccgaga 


accggcaaca 


actttcagtt 


cagctacacc 


ttcgaggacg 


2100 


tgcctttcca 


cagcagctac 


gcgcacagcc 


agagcctgga 


caggctgatg 


aatcccctca 


2160 


tcgaccagta 


cctgtactac 


ctggtcagaa 


cgcaaacgac 


tggaactgga 


gggacgcaga 


2220 


ctctggcatt 


cagccaagcg 


ggtcctagct 


caatggccaa 


ccaggctaga 


aattgggtgc 


2280 


ccggaccttg 


ctaccggcag 


cagcgcgtct 


ccacgacaac 


caaccagaac 


aacaacagca 


2340 


actttgcctg 


gacgggagct 


gccaagttta 


agctgaacgg 


ccgagactct 


ctaatgaatc 


24C0 


cgggcgtggc 


aatggcttcc 


cacaaggatg 


acgacgaccg 


cttcttccct 


tcgagcgggg 


2460 


tcctgatttt 


tggcaagcaa 


ggagccggga 


acgatggagt 


ggattacagc 


caagtgctga 


2520 


ttacagatga 


ggaagaaatc 


aaggctacca 


accccgtggc 


cacagaagaa 


tatggagcag 


2580 


tggccatcaa 


caaccaggcc 


gccaatacgc 


aggcgcagac 


cggactcgtg 


cacaaccagg 


2640 


gggtgattcc 


cggcatggtg 


tggcagaata 


gagacgtgta 


cctgcagggt 


cccatctggg 


2700 


ccaaaattcc 


tcacacggac 


ggcaactttc 


acccgtctcc 


cctgatgggc 


ggctttggac 


2760 


tgaagcaccc 


gcctcctcaa 


attctcatca 


agaacacacc 


ggttccagcg 


gacccgccgc 


2820 


ttaccttcaa 


ccaggccaag 


ctgaactctt 


tcatcacgca 


gtacagcacc 


ggacaggtca 


2880 
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gcgtggaaat 


cgagtgggag 


ctgcagaaag 


aaaacagcaa 


acgctggaat 


ccagagattc 


2940 


aatacacttc 


caactactac 


aaatctacaa 


atgtggactt 


tgctgtcaac 


acggaaggag 


3000 


tttatagcga 


gcctcgcccc 


attggcaccc 


gttacctcac 


ccgcaacctg 


taattacatg 


3060 


ttaatcaata 


aaccggttaa 


ttcgtttcag 


ttgaactttg 


gtctctgcga 


agggcgaatt 


3120 


c 
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<211> 3122 
<212> DNA 

<213> new AAV serotype, clone 43.25 








<400> 45 
gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


120 


ccgccaaggc 


cattctcggc 


ggcagcaagg 


tgcgtgtgga 


ccaaaagtgc 


aagtcttccg 


180 


cccagatcga 


tcccaccccc 


gtgatcgtca 


cctccaacac 


caacatgtgc 


gccgtgattg 


240 


acgggaacag 


caccaccttc 


gagcaccagc 


agccgttgca 


ggaccggatg 


ttcaaatttg 


300 


aactcacccg 


ccgtctggag 


catgactttg 


gcaaggtgac 


gaagcaggaa 


gtcaaagggt 


360 


tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttccac 


gtgcgagccc 


420 


aagcgggcct 


gcccctcagt 


cgcggatcca 


tcgacgtcag 


accagaaagg 


gtggagccaa 


480 


caagagaccc 


gcccccgatg 


acgcggatat 


aagcggaagg 


agctccggtg 


gactttgccg 


540 


acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 


agacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accagagact 


660 


gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 


ggaaactctg 


tgcgattcat 


catctgctgg 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 


gcgatctggt 


caacgtggac 


ctggatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 


ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 


cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 


gacaaggggg 


agcccgtcaa 


cgcggcggac 


gcagcggccc 


tcgagcacga 


caaagcctac 


1080 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataatcacgc 


cgacgccgag 


1140 


tttcaggagc 


gtctgcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 


caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 


cctggaaaga 


agagaccggt 


agagcagtcg 


ccacaagagc 


cagactcctc 


ctcgggcatc 


1320 


ggcaagacag 


gccagcagcc 


cgctaaaaag 


agactcaatt 


ttggtcagac 


tggcgactca 


1380 


gagtcagtcc 


ccgacccaca 


acctctcgga 


gaacctccag 


cagccccctc 


aggtctggga 


1440 
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55 



cctaatacaa 


tggcttcagg 


cggtggcgct 


ccaatggcag 


acaataacga 


aggcgccgac 


1500 


ggagtgggta 


attcctcggg 


aaattggcat 


tgcgattcca 


catggctggg 


ggacagagtc 


1560 


atcaccacca 


gcacccgaac 


ctgggccctg 


cccacctaca 


acaaccacct 


ctacaagcaa 


1620 


atctccaacg 


gcacctcggg 


aggaagcacc 


aacgacaaca 


cctattttgg 


ctacagcacc 


1680 


ccctgggggt 


attttgactt 


caacagattc 


cactgtcact 


tttcaccacg 


tgactggcaa 


1740 


cgactcatca 


acaacaattg 


gggattccgg 


cccaaaagac 


tcaacttcaa 


gctgttcaac 


1800 


atccaggtca 


aggaagtcac 


gacgaacgaa 


ggcaccaaga 


ccatcgccaa 


taatctcacc 


1860 


agcaccgtgc 


aggtctttac 


ggactcggag 


taccagttac 


cgtacgtgct 


aggatccgct 


1920 


caccagggat 


gtctgcctcc 


gttcccggcg 


gacgtcttca 


tggttcctca 


gtacggctat 


1980 


ttaactttaa 


acaatggaag 


ccaagccctg 


ggacgttcct 


ccttctactg 


tctggagtat 


2040 


ttcccatcgc 


agatgctgag 


aaccggcaac 


aactttcagt 


tcagctacac 


cttcgaggac 


2100 


gtgcctttcc 


acagcagcta 


cgcgcacagc 


cagagcctgg 


acaggctgat 


gaatcccctc 


2160 


atcgaccagt 


acctgtacta 


cctggtcaga 


acgcaaacga 


ctggaactgg 


agggacgcag 


2220 


actctggcat 


tcagccaagc 


gggtcctagc 


tcaatggcca 


accaggctag 


aaattgggtg 


2280 


cccggacctt 


gctaccggca 


gcagcgcgtc 


tccacgacaa 


ccaaccagaa 


caacaacagc 


2340 


aactttgcct 


ggacgggagc 


tgccaagttt 


aagctgaacg 


gccgagactc 


tctaatgaat 


2400 


ccgggcgtgg 


caatggcttc 


ccacaaggat 


gacgacgacc 


gcttcttccc 


ttcgagcggg 


2460 


gtcctgattt 


ttggcaagca 


aggagccggg 


aacgatggag 


tggattacag 


ccaagtgctg 


2520 


attacagatg 


aggaagaaat 


caaggctacc 


aaccccgtgg 


ccacagaaga 


atatggagca 


2580 


gtggccatca 


acaaccaggc 


cgccaatacg 


caggcgcaga 


ccggactcgt 


gcacaaccag 


2640 


ggggtgattc 


ccggcatggt 


gtggcagaat 


agagacgtgt 


acctgcaggg 


tcccatctgg 


2700 


gccaaaattc 


ctcacacgga 


cggcaacttt 


cacccgtctc 


ccctgatggg 


cggctttgga 


2760 


ctgaagcacc 


cgcctcctca 


aattctcatc 


aagaacacac 


cggttccagc 


ggacccgccg 


2820 


cttaccttca 


accaggccaa 


gctgaactct 


ttcatcacgc 


agtacagcac 


cggacaggtc 


2880 


agcgtggaaa 


tcgagtggga 


gctgcagaaa 


gaaaacagca 


aacgctggaa 


tccagagatt 


2940 


caatacactt 


ccaactacta 


caaatctaca 


aatgtggact 


ttgctgtcaa 


cacggagggg 


3000 


gtttatagcg 


agcctcgccc 


cattggcacc 


cgttacctca 


cccgcaacct 


gtaattacat 


3060 


gttaatcaat 


aaaccggtta 


attcgtttca 


gttgaacttt 


ggtctctgcg 


aagggcgaat 


3120 


tc 
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gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 


5 


gcgtcgacaa 


gatgttgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


120 




ccgccaaggc 


cattctcggc 


ggcagcaaag 


tgcgcgtgga 


ccaaaagtgc 


aagccgtccg 


180 




cccagatcga 


ccccaccccc 


gtgatcgtca 


cctccaacac 


caacatgtgc 


gccgtgattg 


240 


10 


acgggaacag 


caccaccttc 


gagcaccagc 


agccgttgcg 


ggaccggatg 


ttcaagtttg 


300 




aactcacccg 


ccgtctggag 


cacgactttg 


gcaaggtgac 


aaagcaggaa 


gtcagagagt 


360 




tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


cgagttctac 


gtcagaaagg 


420 


15 


gtggagccaa 


caagagaccc 


gcccccgatg 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 




gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 




acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 


20 


aaacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accagagact 


660 




gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaaa 


aagacgtatc 


720 




ggaaactctg 


tgcgattcat 


catctgctgg 


ggcgggcacc 


cgagattgct 


tgctcggcct 


780 


25 


gcgatctggt 


caacgtggac 


ctagatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 




ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 




cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


30 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 




gacaaggggg 


agcccgtcaa 


cgcggcggac 


gcagcggccc 


tcgagcacga 


caaggcctac 


1080 




gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


cgacgccgag 


1140 


35 


tttcaggagc 


gtctgcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 




caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 




cctggaaaga 


agagaccggt 


agagccatca 


ccccagcgtt 


ctccagactc 


ctctacgggc 


1320 


40 


atcggcaaga 


aaggccagca 


gcccgcgaaa 


aagagactca 


actttgggca 


gactggcgac 


1380 




tcagagtcag 


tgcccgaccc 


tcaaccaatc 


ggagaacccc 


ccgcaggccc 


ctctggtctg 


1440 




ggatctggta 


caatggctgc 


aggcggtggc 


gctccaatgg 


cagacaataa 


cgaaggcgcc 


1500 


45 


gacggagtgg 


gtagttcctc 


aggaaattgg 


cattgcgatt 


ccacatggct 


gggcgacaga 


1560 




gtcatcacca 


ccagcacccg 


aacctgggcc 


ctccccacct 


acaacaacca 


cctctacaag 


1620 




caaatctcca 


acgggacttc 


gggaggaagc 


accaacgaca 


acacctactt 


cggctacagc 


1680 


50 


accccctggg 


ggtattttga 


ctttaacaga 


ttccactgcc 


acttctcacc 


acgtgactgg 


1740 




cagcgactca 


tcaacaacaa 


ctggggattc 


cggcccaaga 


gactcaactt 


caagctcttc 


1800 


55 


aacatccagg 


tcaaggaggt 


cacgcagaat 


gaaggcacca 


agaccatcgc 


caataacctt 


1860 


accagcacga 


ttcaggtctt 


tacggactcg 


gaataccagc 


tcccgtacgt 


cctcggctct 


1920 
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gcgcaccagg gctgcctgcc tccgttcccg gcggacgtct tcatgattcc tcagtacggg 1980 
tacctgactc tgaacaatgg cagtcaggcc gtgggccgtt cctccttcta ctgcctggag 2040 
tactttcctt ctcaaatgct gagaacgggc aacaactttg agttcagcta ccagtttgag 2100 
gacgtgcctt ttcacagcag ctacgcgcac agccaaagcc tggaccggct gatgaacccc 2160 
ctcatcgacc agtacctgta ctacctgtct cggactcagt ccacgggagg taccgcagga 2220 
actcagcagt tgctattttc tcaggccggg cctaataaca tgtcggctca ggccaaaaac 2280 
tggctacccg ggccctgcta ccggcagcaa cgcgtctcca cgacactgtc gcaaaataac 2340 

aacagcaact gtaaatcccg gtgtcgctat ggcaacccac aaggacgacg aagagcgatt 2400 

ttgcctggac cggtgccacc aagtatcatc tgaatggcag agactctctg ttttccgtcc 24 60 

agcggagtct taatgtttgg gaaacaggga gctggaaaag acaacgtgga ctatagcagc 2520 

gttatgctaa ccagtgagga agaaattaaa accaccaacc cagtggccac ggaacagtac 2580 

ggcgtggtgg ccgataacct gcaacagcaa aacgccgctc ctattgtagg ggccgtcaac 2640 

agtcaaggag ccttacctgg catggtctgg cagaaccggg acgtgtacct gcagggtcct 2700 

atctgggcca agattcctca cacggacgga aactttcatc cctcgccgct gatgggaggc 2760 

tttggactga aacacccgcc tcctcagatc ctgattaaga atacacctgt tcccgcggat 2820 

cctccaacta ccttcagtca agctaagctg gcgtcgttca tcacgcagta cagcaccgga 2880 

caggtcagcg tggaaattga atgggagctg cagaaagaaa acagcaaacg ctggaaccca 2S4 0 

gagattcaat acacttccaa ctactacaaa tctacaaatg tggacttcgc tgttaacaca 3000 

gatggcactt attctgagcc tcgccccatt ggcacccgtt acctcacccg taatctgtaa 3060 

ttgctcgtta atcaataaac cggttgattc gtttcagttg aactttggtc tctgcgaagg 3120 



gcgaattc 



<210> 47 

<211> 3128 

40 <212> DNA 

<213> new AAV serotype, clone 44.5 

<400> 47 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 



3128 



55 



60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaaag tgcgcgtgga ccaaaagtgc aagtcgtccg 180 

cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 24 0 

acgggaacag caccaccttc gagcaccagc agccgttgca ggaccggatg ttcaagtttg 300 

aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcagagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca cgagttctac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 
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gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 


acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 


aaacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accagagact 


660 


gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


tgtcagaaaa 


aagacgtatc 


720 


ggaaactctg 


tgcgattcat 


catctgctgg 


ggcgggcacc 


cgagattgct 


tgctcggcct 


780 


gcgatctggt 


caacgtggac 


ctagatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 


ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatt 


900 


cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 


gacgacggcc 


ggggtctggt 


gcttcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 


gacaaggggg 


agcccgtcaa 


cgcggcggac 


gcagcggccc 


tcgagcacga 


caaggcctac 


1080 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


cgacgccgag 


1140 


tttcaggagc 


gtctgcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 


caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 


cctggaaaga 


agagaccggt 


agagccatca 


ccccagcgtt 


ctccagactc 


ctctacgggc 


1320 


atcggcaaga 


aaggccagca 


gcccgcgaaa 


aagagactca 


actttgggca 


gactggcgac 


1380 


tcagagtcag 


tgcccgaccc 


tcaaccaatc 


ggagaacccc 


ccgcaggccc 


ctctggtctg 


1440 


ggatctggta 


caatggctgc 


aggcggtggc 


gctccaatgg 


cagacaataa 


cgaaggcgcc 


1500 


gacggagtgg 


gtagttcctc 


aggaaattgg 


cattgcgatt 


ccacatggct 


gggcgacaga 


1560 


gtcatcacca 


ccagcacccg 


aacctgggcc 


ctccccacct 


acaacaacca 


cctctacaag 


1620 


caaatctcca 


acgggacttc 


gggaggaagc 


accaacgaca 


acacctactt 


cggctacagc 


1680 


accccctggg 


ggtattttga 


ctttaacaga 


ttccactgcc 


acttctcacc 


acgtgactgg 


1740 


cagcgactca 


tcaacaacaa 


ctggggattc 


cggcccaaga 


gacccaactt 


caagctcttc 


1800 


aacatccagg 


tcaaggaggt 


cacgcagaat 


gaaggcacca 


agaccatcgc 


caataacctt 


1860 


accagcacga 


ttcaggtctt 


tacggactcg 


gaataccagc 


tcccgtacgt 


cctcggctct 


1920 


gcgcaccagg 


gctgcctgcc 


tccgttcccg 


gcggacgtct 


tcatgattcc 


tcagtacggg 


1980 


tacctgactc 


tgaacaatgg 


cagtcaggcc 


gtgggccgtt 


cctccttcta 


ctgcctggag 


2040 


tactttcctt 


ctcaaatgct 


gagaacgggc 


aacaactttg 


agttcagcta 


ccagtttgag 


2100 


gacgtgcctt 


ttcacagcag 


ctacgcgcac 


agccaaagcc 


tggaccggct 


gatgaacccc 


2160 


ctcatcgacc 


agtacctgta 


ctacctgtct 


cggactcagt 


ccacgggagg 


taccgcagga 


2220 


actcagcagt 


tgctattttc 


tcaggccggg 


cctaataaca 


tgtcggctca 


ggccaaaaac 


2280 


tggctacccg 


ggccctgcta 


ccggcagcaa 


cgcgtctcca 


cgacactgtc 


gcaaaataac 


2340 


aacagcaact 


ttgcctggac 


cggtgccacc 


aagtatcatc 


tgaatggcag 


agactctctg 


2400 



55 
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15 



gtaaatcccg 


gtgtcgctat 


ggcaacccac 


aaggacgacg 


aagagcgatt 


ttttccgtcc 


2460 


agcggagtct 


taatgtttgg 


gaaacaggga 


gctggaaaag 


acaacgtgga 


ctatagcagc 


2520 


gttatgctaa 


ccagtgagga 


agaaattaaa 


accaccaacc 


cagtggccac 


agaacagtac 


2580 


ggcgtggtgg 


ccgataacct 


gcaacagcaa 


aacgccgctc 


ctattgtagg 


ggccgtcaac 


2640 


agtcaaggag 


ccttacctgg 


catggtctgg 


cagaaccggg 


acgtgtacct 


gcagggtcct 


2700 


atctgggcca 


agattcctca 


cacggacgga 


aactttcatc 


cctcgccgct 


gatgggaggc 


2760 


tttggactga 


aacacccgcc 


tcctcagatc 


ctgattaaga 


atacacctgt 


tcccgcggat 


2820 


cctccaacta 


ccttcagtca 


agctaagctg 


gcgtcgttca 


tcacgcagta 


cagcaccgga 


2880 


caggtcagcg 


tggaaattga 


atgggagctg 


cagaaagaaa 


acagcaaacg 


ctggaaccca 


2940 


gagattcaat 


acacttccaa 


ctactacaaa 


tctacaaatg 


tggactttgc 


tgttaacaca 


3000 


gatggcactt 


attctgagcc 


tcgccccatt 


ggcacccgtt 


acctcacccg 


taatctgtaa 


3060 


ttgcttgtta 


atcaataaac 


cggttgattc 


gtttcagttg 


aactttggtc 


tctgcgaagg 


3120 


gcgaattc 












3128 



<210> 48 

<211> 1933 

<212> DNA 

<213> new AAV serotype, clone 223.10 
<220> 

<221> inisc_f eature 

<222> (1302) . . (1302) 



<223> can 


be a, c, g 


or t 










<400> 48 
caaggcctac 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


60 


cgacgccgag 


tttcaggagc 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


120 


agcagtcttc 


caggccaaaa 


agcgggttct 


cgaacctctt 


ggtctggttg 


agacgccagc 


180 


taagacggca 


cctggaaaga 


agcgaccggt 


agactcgcca 


gactccacct 


cgggcatcgg 


240 


caagaaaggc 


cagcagcccg 


cgaaaaagag 


actcaacttt 


gggcagactg 


gcgactcaga 


300 


gtcagtcccc 


gaccctcaac 


caatcggaga 


accaccagca 


ggcccctctg 


gtctgggatc 


360 


tggtacaatg 


gctgcaggcg 


gtggcgcacc 


aatggctgac 


aataacgagg 


gcgccgacgg 


420 


agtgggtaat 


gcctcaggaa 


attggcattg 


cgattccaca 


tggctgggcg 


acagagtcat 


480 


caccaccagc 


acccgaacct 


gggccctgcc 


cacctacaac 


aaccacctct 


acaagcaaat 


540 


ctccagtcag 


tcagcaggga 


gcaccaacga 


taacgtctat 


ttcggctaca 


gcaccccctg 


600 


ggggtatttt 


gacttcaaca 


gattccattg 


ccacttctca 


ccacgtgact 


ggcagcgact 


660 


tatcaacaac 


aactggggat 


tccggcccaa 


gaagctcaac 


ttcaagctct 


tcaacatcca 


720 


ggtcaaggag 


gtcacgacga 


atgacggtgt 


cacaaccatc 


gctaataacc 


ttaccagcac 


780 
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30 



ggttcaggtc 


ttttcggact 


cggaatatca 


actgccgtac 


gtcctcggct 


ccgcgcacca 


840 


gggctgcctg 


cctccgttcc 


cggcagacgt 


gttcatgatt 


ccgcagtacg 


gatacctgac 


900 


tctgaacaat 


ggcagccaat 


cggtaggccg 


ttcctccttc 


tactgcctgg 


agtactttcc 


960 


ttctcagatg 


ctgagaacgg 


gcaacaactt 


cacctttagc 


tacaccttcg 


aggacgtgcc 


1020 


tttccacagc 


agctacgcgc 


acagccagag 


tctggaccgg 


ctgatgaatc 


ccctcatcga 


1080 


ccagtacctg 


tactacttgg 


ccagaacaca 


gagcaacgca 


ggaggtactg 


ctggcaatcg 


1140 


ggaactgcag 


ttttatcagg 


gcggacctac 


caccatggcc 


gaacaagcaa 


agaactggct 


1200 


gcccggacct 


tgcttccggc 


aacagagagt 


atccaagacg 


ctggatcaaa 


ataacaacag 


1260 


caactttgcc 


tggactggtg 


ccacaaaata 


ccatttaaat 


gnaagaaatt 


cattggttaa 


1320 


tcccggtgtc 


gccatggcaa 


cccacaagga 


cgacgaggaa 


cgcttcttcc 


cttcgagcgg 


1380 


agttctaatt 


tttggcaaaa 


ctggagcagc 


taataaaact 


acattagaaa 


acgtgctcat 


1440 


gacaaatgaa 


gaagaaattc 


gtcctaccaa 


cccggtagct 


accgaggaat 


acgggattgt 


1500 


aagcagcaac 


ttgcaggcgg 


ctagcaccgc 


agcccagaca 


caagttgtta 


acaaccaggg 


1560 


agccttacct 


ggcatggtct 


ggcagaaccg 


ggacgtgtac 


ctgcaaggtc 


ccatttgggc 


1620 


caagattcct 


cacacggacg 


gcaactttca 


cccgtctcct 


ctaatgggtg 


gctttggact 


1680 


gaaacacccg 


cctccccaga 


tcctgatcaa 


aaacacaccg 


gtacctgcta 


atcctccaga 


1740 


agtgtttact 


cctgccaagt 


ttgcttcctt 


catcacgcag 


tacagcaccg 


ggcaagtcag 


1800 


cgttgagatc 


gagtgggagc 


tgcagaaaga 


gaacagcaag 


cgctggaacc 


cagagattca 


1860 


gtacacctcc 


aactttgaca 


aacagactgg 


agtggacttt 


gctgttgaca 


gccagggtgt 


1920 


ttactctgag 


cct 










1933 



35 

<210> 49 

<211> 1933 

<212> DNA 

<213> new AAV serotype, clone 223.2 

40 

<400> 49 





caaggcctac 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


60 




cgacgccgag 


tttcaggagt 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


120 


45 


agcagtcttc 


caggccaaaa 


agcgggttct 


cgaacctctt 


ggtctggttg 


agacgccagc 


180 




taagacggca 


cctggaaaga 


agcgaccggt 


agactcgcca 


gactccacc t 


cgggcatcgg 


240 




caagaaaggc 


cagcagcccg 


cgaaaaagag 


actcaacttt 


gggcagactg 


gcgactcaga 


300 


50 


gtcagtcccc 


gaccctcaac 


caatcggaga 


accaccagca 


ggcccctctg 


gtctgggatc 


360 




tggtacaatg 


gttgcaggcg 


gtggcgcacc 


aatggctgac 


aataacgagg 


gcgccgacgg 


420 




agtgggtaat 


gcctcaggaa 


attggcattg 


cgattccaca 


tggctgggcg 


acagagtcat 


480 


55 


caccaccagc 


acccgaacct 


gggccctgcc 


cacctacaac 


aaccacctct 


acaagcaaat 


540 
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ctccagtcag 


tcagcaggga 


gcaccaacga 


taacgtctat 


ttcggctaca 


gcaccccctg 


600 




ggggtatttt 


gacttcaaca 


gattccattg 


ccacttctca 


ccacgtgact 


ggcagcgact 


660 


5 


tatcaacaac 


aactggggat 


tccggcccaa 


gaagctcaac 


ttcaagctct 


tcaacatcca 


720 




ggtcaaggag 


gtcacgacga 


atgacggtgt 


cacaaccatc 


gctaataacc 


ttaccagcac 


780 




ggttcaggtc 


ttttcggact 


cggaatatca 


actgccgtac 


gtcctcggct 


ccgcgcacca 


840 


10 


gggctgcctg 


cctccgttcc 


cggcagacgt 


gttcatgatt 


ccgcagtacg 


gatacctgac 


900 




tctgaacaat 


ggcagccaat 


cggtaggccg 


ttcctccttc 


tactgcctgg 


agtactttcc 


960 




ttctcagatg 


ctgagaacgg 


gcaacaactt 


cacctttagc 


tacaccttcg 


aggacgtgcc 


1020 


15 


tttccacagc 


agctacgcgc 


acagccagag 


tctggaccgg 


ctgatgaatc 


ccctcatcga 


1080 




ccagtacctg 


tactacttgg 


ccagaacaca 


gagcaacgca 


ggaggtactg 


ctggcaatcg 


1140 




ggaactgcag 


ttttatcagg 


gcggacctac 


caccatggcc 


gaacaagcaa 


agaactggct 


1200 


20 


gcccggacct 


tgcttccggc 


aacagagagt 


atccaagacg 


ctggatcaaa 


ataacaacag 


1260 




caactttgcc 


tggactggtg 


ccacaaaata 


ccatttaaat 


ggaagaaatt 


cattggttaa 


1320 




tcccggtgtc 


gccatggcaa 


cccacaagga 


cgacgaggaa 


cgcttctccc 


cttcgagcgg 


1380 


25 


agttctaatt 


tttggcaaaa 


ctggagcagc 


taataaaact 


acattagaaa 


acgtgctcat 


1440 




gacaaatgaa 


gaagaaattc 


gtcctaccaa 


cccggtagct 


accgaggaat 


acgggattgt 


1500 




aagcagcaac 


ttgcaggcgg 


ctagcaccgc 


agcccagaca 


caagttgtt a 


acaaccaggg 


1560 


30 


agccttacct 


ggcatggtct 


ggcagaaccg 


ggacgtgtac 


ctgcaaggtc 


ccatttgggc 


1620 




caagattcct 


cacacggacg 


gcaactttca 


cccgtctcct 


ctaatgggtg 


gctttggact 


1680 




gaaacacccg 


cctccccaga 


tcctgat caa 


aaacacgccg 


gtacc tgct a 


atcc tccaga 


1740 


35 


agtgtttact 


cctgccaagt 


ttgcttcctt 


catcacgcag 


tacagcaccg 


ggcaagtcag 


1800 




cgttgagatc 


gagtgggagc 


tgcagaaaga 


gaacagcaag 


cgctggaacc 


cagagattca 


1860 




gtacacctcc 


aactttgaca 


aacagactgg 


agtggacttt 


gctgttgaca 


gccagggtgt 


1920 


40 


ttactctgag 


cct 










1933 



<210> 50 
<211> 1933 
<212> DNA 

<213> new AAV serotype, clone 22 3.4 

<400> 50 

caaggcctac gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc 60 
cgacgccgag tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg 120 
agcagtcttc caggccaaaa agcgggttct cgaacctctt ggtctggttg agacgccagc 180 
taagacggca cctggaaaga agcgaccggt agactcgcca gactccacct cgggcatcgg 240 
caagaaaggc cagcagcccg cgaaaaagag actcaacttt gggcagactg gcgactcaga 300 
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gccagtcccc gaccctcaac caatcggaga 
tggtacaatg gctgcaggcg gtggcgcacc 

5 

agtgggtaat gcctcaggaa attggcattg 
caccaccagc acccgaacct gggccctgcc 
ctccagtcag tcagcaggga gcaccaacga 

10 

ggggtatttt gacttcaaca gattccattg 
tatcaacaac aactggggat tccggcccaa 
ggtcaaggag gtcacgacga atgacggcgt 

15 

ggttcaggtc ttttcggact cggaatatca 
gggctgcctg cctccgttcc cggcagacgt 
tctgaacaat ggcagccaat cggtaggccg 

20 

ttctcagatg ctgagaacgg gcaacaactt 
tttccacagc agctacgcgc acagccagag 
ccagtacctg tactacttgg ccagaacaca 

25 

ggaactgcag ttttatcagg gcggacctac 
gcccggacct tgcttccggc aacagagagt 
caactttgcc tggactggtg ccacaaaata 

30 

tcccggtgtc gccatggcaa cccacaagga 
agttctaatt tttggcaaaa ctggagcagc 
gacaaatgaa gaagaaattc gtcctaccaa 

35 

aagcagcaac ttgcaggcgg ctagcaccgc 
agccttacct ggcatggtct ggcagaaccg 
caagattcct cacacggacg gcaactttca 

40 

gaaacacccg cctccccaga tcctgatcaa 
agtgtttact cctgccaagt ttgcttcctt 
cgttgagatc gaatgggagc tgcagaaaga 

45 

gtacacctcc aactttgaca aacagactgg 
ttactctgag cct 

50 <210> 51 

<211> 1933 
<212> DNA 

<213> new AAV serotype, clone 2 
<400> 51 

55 caaggcctac gaccagcagc tcaaagcggg 



accaccagca ggcccctctg gtctgggatc 360 

aatggctgac aataacgagg gcgccgacgg 420 

cgattccaca cggctgggcg acagagtcat 480 

cacctacaac aaccacctct acaagcaaat 540 

taacgtctat ttcggctaca gcaccccctg 600 

ccacttctca ccacgtgact ggcagcgact 660 

gaagctcaac ttcaagctct tcaacatcca 720 

cacaaccatc gctaataacc ttaccagcac 780 

actgccgtac gtcctcggct ccgcgcacca 840 

gttcatgatt ccgcagtacg gatacctgac 900 

ttcctccttc tactgcctgg agtactttcc 960 

cacctttagc tacaccttcg aggacgtgcc 1020 

tctgggccgg ctgatgaatc ccctcatcga 1080 

gagcaacgca ggaggtactg ctggcaatcg 1140 

caccatggcc gaacaagcaa agaactggct 1200 

atccaagacg ctggatcaaa ataacaacag 1260 

ccatttaaat ggaagaaatt cattggttaa 1320 

cgacgaggaa cgcttcttcc cttcgagcgg 1380 

taataaaact acattagaaa acgtgctcat 1440 

cccggtagct accgaggaat acgggattgt 1500 

agcccagaca caagttgtta acaaccaggg 1560 

ggacgtgtac ctgcaaggtc ccatttgggc 162 0 

cccgtctcct ctaatgggtg gctttggact 1680 

aaacacaccg gtacctgcta atcctccaga 1740 

catcacgcag tacagcaccg ggcaagtcag 1800 

gaacagcaag cgctggaacc cagagattca 18 60 

agtggacttt gctgttgaca gccagggtgt 1920 

1933 

>3.5 

tgacaatccg tacctgcggt ataaccacgc 60 
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25 



30 



35 



40 



45 



50 



cgacgccgag 


tttcaggagc 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


120 


agcagtcttc 


caggccaaaa 


agcgggttct 


cgaacctctt 


ggtctggttg 


agacgccagc 


180 


taagacggca 


cctggaaaga 


agcgaccggt 


agactcgcca 


gactccacct 


cgggcatcgg 


240 


caagaaaggc 


cagcagcccg 


cgaaaaagag 


actcaacttt 


gggcagactg 


gcgactcaga 


300 


gccagtcccc 


gaccctcaac 


caatcggaga 


accaccagca 


ggcccctctg 


gtctgggatc 


360 


tggtacaatg 


gctgcaggcg 


gtggcgcacc 


aatggctgac 


aataacgagg 


gcgccgacgg 


420 


agtgggtaat 


gcctcaggaa 


attggcattg 


cgattccaca 


cggctgggcg 


acagagtcat 


480 


caccaccagc 


acccgaacct 


gggccctgcc 


cacctacaac 


aaccacctct 


acaagcaaat 


540 


ctccagtcag 


tcagcaggga 


gcaccaacga 


taacgtctat 


ttcggctaca 


gcaccccctg 


600 


ggggtatttt 


gacttcaaca 


gattccattg 


ccacttct ca 


ccacgtgact 


ggcagcgact 


660 


t atcaacaac 


aactggggat 


tccggcccaa 


gaagct caac 


ttcaagctct 


tcaacatcca 


720 


ggtcaaggag 


gtcacgacga 


atgacggcgt 


cacaaccatc 


gctaataacc 


ttaccagcac 


780 


ggttcaggtc 


ttttcggact 


cggaatatca 


actgccgtac 


gtcctcggct 


ccgcgcacca 


840 


gggctgcctg 


cctccgttcc 


cggcagacgt 


gttcatgatt 


ccgcagtacg 


gatacctgac 


900 


tctgaacaat 


ggcagccaat 


cggtaggccg 


ttcctccttc 


tactgcctgg 


agtactttcc 


960 


ttctcagatg 


ctgagaacgg 


gcaacaactt 


cacctttagc 


tacaccttcg 


aggacgtgcc 


1020 


tttccacagc 


agctacgcgc 


acagccagag 


tctgggccgg 


ctgatgaatc 


ccctcatcga 


1080 


ccagtacctg 


tactacttgg 


ccagaacaca 


gagcaacgca 


ggaggtactg 


ctggcaatcg 


1140 


ggaactgcag 


ttttatcagg 


gcggacctac 


caccatggcc 


gaacaagcaa 


agaactggct 


1200 


gcccggacct 


tgcttccggc 


aacagagagt 


atccaagacg 


ctggatcaaa 


ataacaacag 


1260 


caactttgcc 


tggactggtg 


ccacaaaata 


ccatttaaat 


ggaagaaatt 


cattggttaa 


1320 


tcccggtgtc 


gccatggcaa 


cccacaagga 


cgacgaggaa 


cgcttcttcc 


cttcgagcgg 


1380 


agttctaatt 


tttggcaaaa 


ctggagcagc 


taataaaact 


acattagaaa 


acgtgctcat 


1440 


gacaaatgaa 


gaagaaattc 


gtcctaccaa 


cccggtagct 


accgaggaat 


acgggattgt 


1500 


aagcagcaac 


ttgcaggcgg 


ctagcaccgc 


agcccagaca 


caagttgtta 


acaaccaggg 


1560 


agccttacct 


ggcatggtct 


ggcagaaccg 


ggacgtgtac 


ctgcaaggtc 


ccatttgggc 


1620 


caagattcct 


cacacggacg 


gcaactttca 


cccgtctcct 


ctaatgggtg 


gctttggact 


1680 


gaaacacccg 


cctccccaga 


tcctgatcaa 


aaacacaccg 


gtacctgcta 


atcctccaga 


1740 


agtgtttact 


cctgccaagt 


ttgcttcctt 


catcacgcag 


tacagcaccg 


ggcaagtcag 


1800 


cgttgagatc 


gaatgggagc 


tgcagaaaga 


gaacagcaag 


cgctggaacc 


cagagattca 


1860 


gtacacctcc 


aactttgaca 


aacagactgg 


agtggacttt 


gctgttgaca 


gccagggtgt 


1920 


ttactctgag 


cct 










1933 



55 
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<210> 52 
<211> 1933 
<212> DNA 
<213> new 


AAV serotype, clone 22 3.6 








<400> 52 
caaggcctac 


gaccagcagc 


tcaaagcggg 


tgacaatccg 


tacctgcggt 


ataaccacgc 


60 


cgacgccgag 


tttcaggagc 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


120 


agcagtcttc 


caggccaaaa 


agcgggttct: 


cgaacctctt 


ggtctggttg 


agacgccagc 


180 


taagacggca 


cctggaaaga 


agcgaccggt 


agactcgcca 


gactccacct 


cgggcatcgg 


240 


caagaaaggc 


cagcagcccg 


cgaaaaagag 


actcaacttt 


gggcagactg 


gcgactcaga 


300 


gtcagtcccc 


gaccctcaac 


caatcggaga 


accaccagca 


ggcccctctg 


gtctgggatc 


360 


tggtacaatg 


gc tgcaggcg 


gtggcgcacc 


aatggctgac 


aatagcgagg 


gcgccgacgg 


420 


agtgggtaat 


gcctcaggaa 


attggcattg 


cgattccaca 


tggctgggcg 


acagagtcat 


480 


caccaccagc 


acccgaacct 


gggccct gcc 


cacc tacaac 


aaccacctct 


acaagcaaat 


540 


ctccagtcag 


tcagcaggga 


gcaccaacga 


taacgtctat 


ttcggctaca 


gcaccccctg 


600 


ggggtatttt 


gacttcaaca 


gattccattg 


ccacttctca 


ccacgtgact 


ggcagcgact 


660 


tatcaacaac 


aactggggat 


tccggcccaa 


gaagct caac 


ttcaagctct 


tcaacatcca 


720 


ggtcaaggag 


gtcacgacga 


atqacggtgt 


cacaaccatc 


gctaataacc 


ttaccagcac 


780 


ggttcaggtc 


ttttcggact 


cggaatatca 


actgccgtac 


gtcctcggct 


ccgcgcacca 


840 


gggctgcctg 


cctccgttcc 


cggcagacgt 


gttcatgatt 


ccgcagtacg 


gatacctgac 


900 


tctgaacaat 


ggcagccaat 


cggtaggccg 


ttcctccttc 


tactgcctgg 


agtactttcc 


960 


ttctcagatg 


ctgagaacgg 


gcaacaactt 


cacctttagc 


tacaccttcg 


aggacgtgcc 


1020 


tttccacagc 


agctacgcgc 


acagccagag 


tctggaccgg 


ctgatgaatc 


ccctcatcga 


1080 


ccagtacctg 


tactacttgg 


ccagaacaca 


gagcaacgca 


ggaggtactg 


ctggcaatcg 


1140 


ggaactgcag 


ttttatcagg 


gcggacctac 


caccatggcc 


gaacaagcaa 


agaactggct 


1200 


gcccggacct 


tgcttccggc 


aacagagagt 


atccaagacg 


ctggatcaaa 


ataacaacag 


1260 


caactttgcc 


tggactggtg 


ccacaaaata 


ccatttaaat 


ggaagaaatt 


cattggttaa 


1320 


tcccggtgtc 


gccatggcaa 


cccacaagga 


cgacgaggaa 


cgcttcttcc 


cttcgagcgg 


1380 


agttctaatt 


tttggcaaaa 


ctggagcagc 


taataaaact 


acattagaaa 


acgtgctcat 


1440 


gacaaatgaa 


gaagaaattc 


gtcctaccaa 


cccggtagct 


accgaggaat 


acgggattgt 


1500 


aagcagcaac 


ttgcaggcgg 


ctagcaccgc 


agcccagaca 


caagttgtta 


acaaccaggg 


1560 


agccttacct 


ggcatggtct 


ggcagaaccg 


ggacgtgtac 


ctgcaaggtc 


ccatttgggc 


1620 


caagattcct 


cacacggacg 


gcaactttca 


cccgtctcct 


ctaatgggtg 


gctttggact 


1680 


gaaacacccg 


cctccccaga 


tcctgatcaa 


aaacacaccg 


gtacctgcta 


atcctccaga 


1740 



55 
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agtgtttact cctgccaagc ttgcttcctt catcacgcag tacagcaccg ggcaagtcag 1800 

cgttgagatc gagtgggagc tgcagaaaga gaacagcaag cgctggaacc cagagattca 1860 

gtacacctcc aactttgaca aacagactgg agtggacttt gctgttgaca gccagggtgt 1920 

ttactctgag cct 1933 
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<210> 53 
<211> 1933 
<212> DNA 

<213> new AAV serotype, clone 223.7 
<400> 53 

caaggcctac gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc 60 

cgacgccgag tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg 120 

agcagtcttc caggccaaaa agcgggttct cgaacctctt ggtctggttg agacgccagc 180 

taagacggca cctggaaaga agcgaccggt agactcgcca gactccacct cgggcatcgg 240 

caagaaaggc cagcagcccg cgaaaaagag actcaacttt gggcagactg gcgactcaga 300 

gtcagtcccc gaccctcaac caatcggaga accaccagca ggcccctctg gtctgggatc 3S0 

tggtacaatg gctgcaggcg gtggcgcacc aatggctgac aataacgagg gcgccgacgg 420 

agtgggtaat gcctcaggaa attggcattg cgattccaca tggctgggcg acagagtcat 480 

caccaccagc acccgaacct gggccctgcc cacctacaac aaccacctct acaagcaaat 540 

ctccagtcag tcagcaggga gcaccaacga taacgtctat ttcggctaca gcaccccctg 600 

ggggtatttt gacttcaaca gattccattg ccacttctca ccacgtgact ggcagcgact 660 

tatcaacaac aactggggat tccggcccaa gaagctcaac ttcaagctct tcaacatcca 720 

ggtcaaggag gtcacgacga atgacggcgt cacaaccatc gctaataacc ttaccagcac 780 

ggttcaggtc ttttcggacc cggaatatca actgccgtac gtcctcggct ccgcgcacca 840 

gggctgcctg cctccgttcc cggcagacgt gttcatgatt ccgcagtacg gatacctgac 900 

tctgaacaat ggcagccaat cggtaggccg ttcctccttc tactgcctgg agtactttcc 960 

ttctcagatg ctgagaacgg gcaacaactt cacctttagc tacaccttcg aggacgtgcc 1020 

tttccacagc agctacgcgc acagccagag tctggaccgg ctgatgaatc ccctcatcga 1080 

ccagtacctg tactacttgg ccagaacaca gagcaacgca ggaggtactg ctggcaatcg 1140 

ggaactgcag ttttatcagg gcggacctac caccatggcc gaacaagcaa agaactggct 1200 

gcccggacct tgcttccggc aacagagagt atccaagacg ctggatcaaa ataacaacag 1260 

caactttgcc tggactggtg ccacaaaata ccatttaaat ggaagaaatt cattggttaa 1320 

tcccggtgtc gccatggcaa cccacaagga cgacgaggaa cgcttcttcc cttcgagcgg 1380 

agttctaatt tttggcaaaa ctggagcagc taataaaact acattagaaa acgtgctcat 1440 

gacaaatgaa gaagaaattc gtcctaccaa cccggtagct accgaggaat acgggattgt 1500 
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aagcagcaac 


ttgcaggcgg 


ctagcaccgc 


agcccagaca 


caagttgtta 


acaaccaggg 


1560 




agccttacct 


ggcatggtct 


ggcagaaccg 


ggacgtgtac 


ctgcaaggtc 


ccatttgggc 


1620 


5 


caagattcct 


cacacggacg 


gcaactttca 


cccgtctcct 


ctaatgggtg 


gctttggact 


1680 




gaaacacccg 


cctccccaga 


tcctgatcaa 


aaacacaccg 


gtacctgcta 


atcctccaga 


1740 


10 


agtgtttact 


cctgccaaga 


ttgcttcctt 


catcacgcag 


tacagcaccg 


ggcaagtcag 


1800 


cgttgagatc 


gagtgggagc 


tgcagaaaga 


gaacagcaag 


cgctggaacc 


cagagattca 


1860 




gtacacctcc 


aactttgaca 


aacagactgg 


agtggacttt 


gctgttgaca 


gccagggtgt 


1920 


15 


ttactctgag 


cct 










1933 


20 


<210> 54 
<211> 3123 
<212> DNA 

<213> new AAV serotype, clone A3 . 4 








<400> 54 
gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


aaactttccc 


ttcaacgatt 


60 




gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggaaagat 


gaccgccaag 


gtcgtggaat 


120 


25 


ctgccaaagc 


cattctgggt 


ggaagcaagg 


ttcgtgtgga 


ccagaaatgc 


aagtcttcgg 


180 




cccagatcga 


cccgactccg 


gtgattgtca 


cctctaacac 


caacatgtgc 


gccgtgattg 


240 




acggaaactc 


gaccaccttc 


gagcaccagc 


agccgttgca 


agaccggatg 


ttcaaatttg 


300 


30 


aacttacccg 


ccgtttggat 


catgactttg 


ggaaggtcac 


caagcaggaa 


gtcaaagact 


360 




ttttccggtg 


ggctcaagat 


cacgtgactg 


aggtggagca 


tgagttctac 


gtcaaaaagg 


420 




gtggagccaa 


gaaaaggccc 


gcccccgatg 


atgtatatat 


aaatgagccc 


aagcgggcgc 


480 


35 


gcgagtcagt 


tgcgcagcca 


tcgacgtcag 


acgcggaagc 


ttcgataaac 


tacgcgggca 


540 




ggtaccaaaa 


caaatgttct 


cgtcacgtgg 


gcatgaatct 


gatgctgttt 


ccctgtcgac 


600 




aatgcgaaag 


aatgaatcag 


aattcaaata 


tctgcttcac 


acacgggcaa 


aaagactgtt 


660 


40 


tggaatgctt 


tcccgtgtca 


gaatctcaac 


ccgtttctgt 


cgtcagaaaa 


acgtatcaga 


720 




aactttgtta 


cattcatcat 


atcatgggaa 


aagaaccaga 


cgcctgcact 


gcctgcgacc 


780 




tggtaaatgt 


ggacttggat 


gactgtattt 


ctgagcaata 


aatgacttaa 


atcaggtatg 


840 


45 


gctgctgacg 


gttatcttcc 


agattggctc 


gaggacactc 


tctctgaagg 


aatcagacag 


900 




tggtggaagc 


tcaaacctgg 


cccaccaccg 


ccgaaaccta 


accaacaaca 


ccgggacgac 


960 




agtaggggtc 


ttgtgcttcc 


tgggtacaag 


tacctcggac 


rT't'+T'aar'rTrr 


CL w ^ w U d CL CLCL 


1020 


50 


ggagagccgg 


tcaacgaggc 


agacgccgcg 


gccctcgagc 


acgacaaagc 


ctacgaccac 


1080 




cagctcaagc 


aaggggacaa 


cccgtacctc 


aaatacaacc 


acgcggacgc 


tgaatttcag 


1140 




gagcgtcttc 


aagaagatac 


gtctttcggg 


ggcaacctcg 


ggcgagcagt 


cttccaggcc 


1200 


55 


aaaaagaggg 


tactcgagcc 


tcttggtctg 


gttgaggaag 


ctgttaagac 


ggctcctgga 


1260 
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aaaaagagac 


ctatagagca 


gtctcctgca 


gaaccggact 


cttcctcggg 


catcggcgaa 


1320 


tcaggccagc 


agcccgctaa 


gaaaagactc 


aattttggtc 


agactggcga 


cacagagtca 


1380 


gtcccagacc 


ctcaaccaat 


cggagaaccc 


cccgcagccc 


cctctggtgt 


gggatctaat 


1440 


acaatggctt 


caggcggtgg 


ggcaccaatg 


gcagacgata 


acgaaggcgc 


cgacggagtg 


1500 


ggtaattcct 


cgggaaattg 


gcattgcgat 


tccacatgga 


tgggcgacag 


agttatcacc 


1560 


accagcacaa 


gaacctgggc 


cctccccacc 


tacaataatc 


acctctacaa 


gcaaatctcc 


1620 


agcgaatcgg 


gagccaccaa 


cgacaaccac 


tacttcggct 


acagcacccc 


ctgggggtat 


1680 


tttgacttta 


acagattcca 


ctgtcacttc 


tcaccacgtg 


actggcagcg 


actcatcaac 


1740 


aacaactggg 


gatttagacc 


caagaaactc 


aatttcaagc 


tcttcaacat 


ccaagtcaag 


1800 


gaggtcacgc 


agaatgat gg 


aaccacgacc 


atcgccaata 


accttaccag 


cacggtgcag 


1860 


gtcttcacag 


actctgagta 


ccagctgccc 


tacgtcctcg 


gttcggctca 


ccagggctgc 


1920 


cttccgccgt 


tcccagcaga 


cgtcttcatg 


attcctcagt 


acggctactt 


gactctgaac 


1980 


aat ggcagcc 


aagcggtagg 


acgttcttca 


ttctactgtc 


tagagtatt t 


tccctctcag 


2040 


atgctgagga 


cgggaaacaa 


cttcaccttc 


agctacactt 


ttgaagacgt 


gcctttccac 


2100 


agcagctacg 


cgcacagcca 


gagtctggat 


cggctgatga 


atcctctcat 


tgaccagtac 


2160 


ctgtattacc 


tgagcaaaac 


tcagggtaca 


agtggaacaa 


cgcagcaatc 


gagactgcag 


2220 


ttcagccaag 


ctgggcctag 


ctccatggct 


cagcaggcca 


aaaactggct 


accgggaccc 


2280 


agctaccgac 


agcagcgaat 


gtctaagacg 


gctaatgaca 


acaacaacag 


tgaatttgct 


2340 


tggactgcag 


ccaccaaata 


ttacctgaat 


ggaagaaatt 


ctctggtcaa 


tcccgggccc 


2400 


ccaatggcca 


gtcacaagga 


cgatgaggaa 


aagtatttcc 


ccatgcacgg 


aaatctcatc 


2460 


tttggaaaac 


aaggcacagg 


aactaccaat 


gtggacattg 


aatcagtgct 


tattacagac 


2520 


gaagaagaaa 


tcagaacaac 


taatcctgtg 


gctacagaac 


aatacggaca 


ggttgccacc 


2580 


aaccatcaga 


gtcaggacac 


cacagcttcc 


tatggaagtg 


tggacagcca 


gggaatctta 


2640 


cctggaatgg 


tgtggcagga 


ccgcgatgtc 


tatcttcaag 


gtcccatttg 


ggccaaaact 


2700 


cctcacacgg 


acggacactt 


tcatccttct 


ccgctcatgg 


gaggctttgg 


actgaaacac 


2760 


cctcctcccc 


agatcctgat 


caaaaacaca 


cctgtgccag 


cgaatcccgc 


gaccactttc 


2820 


actcctggaa 


agtttgcttc 


gttcattacc 


cagtattcca 


ccggacaggt 


cagcgtggaa 


2880 


a t* a rra nt* ctcict 
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tccaactaca 


acaagtcggt 


gaatgtggag 


tttaccgtgg 


acgcaaacgg 


tgtttattct 


3000 


gaaccccgcc 


ctattggcac 


tcgttacctt 


acccggaact 


tgtaatttcc 


tgttaatgaa 


3060 


taaaccgatt 


tatgcgtttc 


agttgaactt 


tggtctctgc 


gaagggcgaa 


ttcgcggccg 


3120 


eta 












3123 



55 
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<210> 55 

<211> 3113 

<212> DNA 

<213> new AAV serotype, clone A3* 5 

<400> 55 



55 



gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


aaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggaaagat 


gaccgccaag 


gtcgtggaat 


120 


ctgccaaagc 


cattctgggt 


ggaagcaagg 


ttcgtgtgga 


ccagaaatgc 


aagtcttcgg 


180 


cccagatcga 


cccgactccg 


gtgattgtca 


cctctaacac 


caacatgtgc 


gccgtgattg 


240 


acggaaactc 


gaccaccttc 


gagcac:cagc 


agccgttgca 


agaccggatg 


ttcaaatttg 


300 


aacttacccg 


ccgtttggat 


catgactttg 


ggaaggtcac 


caagcaggaa 


gtcaaagact 


360 


t tttccggtg 


ggctcaagat 


cacgtgactg 


aggtggagca 


tgagttctac 


gtcaaaaagg 


420 


gtggagccaa 


gaaaaggccc 


gcccccgatg 


atgtatatat 


aaatgagccc 


aagcgggcgc 


480 


gcgagtcagt 


tgcgcagcca 


tcgacgtcag 


acgcggaagc 


ttcgataaac 


tacgcggaca 


540 


ggtaccaaaa 


caaatgtt ct 


cgtcacgtgg 


gcatgaatct 


gatgctgttt 


ccctgtcgac 


600 


aatgcgaaag 


aatgaatcag 


aattcaaata 


tctgcttcac 


acacgggcaa 


aaagactgtt 


660 


tggaatgctt 


tcccgtgtca 


gaatctcaac 


ccgttcctgt 


cgtcagaaaa 


acgtatcaga 


720 


aactttgtta 


cattcatcat 


atcatgggaa 


aagtaccaga 


cgcctgcact 


gcctgcgacc 


780 


tggtaaatgt 


ggacttggat 


gactgtattt 


ctgagcaata 


aatgacttaa 


atcaggtatg 


840 


gctgctgacg 


gttatcttcc 


agattggctc 


gaggacactc 


tctctgaagg 


aatcagacag 


900 


tggtggaagc 


tcaaacctgg 


cccaccaccg 


ccgaaaccta 


accaacaaca 


ccgggacgac 


960 


agtaggggtc 


ttgtgcttcc 


tgggtacaag 


tacctcggac 


ccttcaacgg 


actcgacaaa 


1020 


ggagagccgg 


tcaacgaggc 


agacgccgcg 


gccctcgagc 


acgacaaagc 


ctacgaccac 


1080 


cagctcaagc 


aaggggacaa 


cccgtacctc 


aaatacaacc 


acgcggacgc 


tgaatttcag 


1140 


gagcgtcttc 


aagaagatac 


gtctttcggg 


ggcaacctcg 


ggcgagcagt 


cttccaggcc 


1200 


aaaaagaggg 


tactcgagcc 


tcttggtctg 


gttgaggaag 


ctgttaagac 


ggctcctgga 


1260 


aaaaagagac 


ctatagagca 


gtctcctgca 


gaaccggact 


ctt cctcggg 


catcggcaaa 


1320 


tcaggccagc 


agcccgctaa 


gaaaagactc 


aattttggtc 


agactggcga 


cacagagtca 


1380 


gtcccagacc 


ctcaaccaat 


cggagaaccc 


cccgcagccc 


cctctggtgt 


gggatctaat 


1440 


acaatggctt 


caggcggtgg 


ggcaccaatg 


gcagacaata 


acgaaggcgc 


cgacggagtg 


1500 


ggtaattcct 


cgggaaattg 


gcattgcgat 


tccacatgga 


tgggcgacag 


agttatcacc 


1560 


accagcacaa 


gaacctgggc 


cctccccacc 


tacaataatc 


acctctacaa 


gcaaatctcc 


1620 


agcgaatcgg 


gagccaccaa 


cgacaaccac 


tacttcggct 


acagcacccc 


ctgggggtat 


1680 


tttgacttta 


acagattcca 


ctgtcacttc 


tcaccacgtg 


actggcagcg 


actcatcaat 


1740 
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aacaactggg 


gatttagacc 


caagaaactc 


aatttcaagc 


tcttcaacat 


ccaagtcaag 


1800 




gaggtcacgc 


agaatgatgg 


aaccacgacc 


atcgccaata 


accttaccag 


cacggtgcag 


1860 


5 


gtcttcacag 


actctgagta 


ccagctgccc 


tacgtcctcg 


gttcggctca 


ccagggctgc 


1920 




cttccgccgt 


tcccagcaga 


cgtcttcatg 


attcctcagt 


acggctactt 


gactctgaac 


1980 


10 


aatggcagcc 


aagcggtagg 


acgttcttca 


ttctactgtc 


tagagtattt 


tccctctcag 


2040 


atgctgagga 


cgggaaacaa 


cttcaccttc 


agctacactt 


ttgaagacgt 


gcctttccac 


2100 




agcagctacg 


cgcacagcca 


gagtctggat 


cggctgatga 


atcctctcat 


tgaccagtac 


2160 


15 


ctgtattacc 


tgagcaaaac 


tcagggtaca 


agtggaacaa 


cgcagcaatc 


gagactgcag 


2220 


ttcaaccaag 


ctgggcctag 


ctccatggct 


cagcaggcca 


aaaactggct 


accgggaccc 


2280 




agctaccgac 


agcagcgaat 


gtctaagacg 


gctaatgaca 


acaacaacag 


tgaatttgct 


2340 


20 


tggactgcag 


ccaccaaata 


ttacccgaat 


ggaagaaatt 


ctctggtcaa 


tcccgggccc 


2400 


ccaatggcca 


gtcacaagga 


cgatgaggaa 


aagtatttcc 


ccatgcacgg 


aaatctcatc 


2460 




tttggaaaac 


aaggcacagg 


aactaccaat 


gtggacattg 


aatcagt get 


tattacagac 


2520 


25 


gaagaagaaa 


tcagaacgac 


taatcctgtg 


gctacagaac 


aatacggaca 


ggttgccacc 


2580 


aaccgtcaga 


gtcagaacac 


cacagcttcc 


tatggaagtg 


tggacagcca 


gggaatctta 


2640 




cctggaatgg 


tgtggcagga 


ccgcgatgtc 


tatcttcaag 


gtcccatttg 


ggccaaaact 


2700 


30 


cctcacacgg 


acggacactt 


tcatccttct 


ccgctcatgg 


gaggctttgg 


actgaaacac 


2760 


cctcctcccc 


agatcctgat 


caaaaacaca 


cctgtgccag 


cgaatcccgc 


gaccactttc 


2820 




actcctggaa 


agtttgcttc 


gttcattacc 


cagtattcca 


ccggacaggt 


cagcgtggaa 


2880 


35 


atagagtggg 


agctgcagaa 


agaaaacagc 


aaacgctgga 


acccggaaat 


tcagtacacc 


2940 


tccaactaca 


acaagtcggt 


gaatgtggag 


tttaccgtgg 


acgcaaacgg 


tgtttattct 


30C0 




gaaccccgcc 


ctattggcac 


tcgttacctt 


acccggaact 


tgtaatttcc 


tgttaatgaa 


3060 


40 


taaaccgatt 


tatgcgtttc 


agttgaactt 


tggtctctgc 


gaagggcgaa 


ttc 


3113 


45 


<210> 56 
<211> 3122 
<212> DNA 

<213> new AAV serotype, clone A3 . 7 








<400> 56 
agcggccgcg 


aatt cgccct 


ttctacggct 


gcgtcaactg 


gaccaatgaa 


aactttccct 


60 




tcaacgattg 


cgtcgacaag 


atggtcatct 


ggtgggagga 


gggaaagatg 


accgccaagg 


120 


50 


tcgtggaatc 


tgccaaagcc 


attctgggtg 


gaagcaaggt 


tcgtgtggac 


cagaaatgca 


180 




ggtcttcggc 


ccagatcgac 


ccgactccgg 


tgattgtcac 


ctctaacacc 


aacatgtgcg 


240 




ccgtgattga 


cggaaactcg 


accaccttcg 


agcaccagca 


gccgttgcaa 


gaccggatgt 


300 


55 


tcaaatttga 


acttacccgc 


cgtttggatc 


atgactttgg 


gaaggtcacc 


aagcaggaag 


360 
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tcaaagactt 


tttccggtgg 


gctcaagatc 


acgtgactga 


ggtggagcat 


gagttctacg 


420 


tcaaaaaggg 


tggagccaag 


aaaaggcccg 


cccccgatga 


tgtatatata 


aatgagccca 


480 


agcgggcgcg 


cgagtcagtt 


gcgcagccat 


cgacgtcaga 


cgcggaagct 


tcgataaact 


540 


acgcggacag 


gtaccaaaac 


aaatgttctc 


gtcacgtggg 


catgaatctg 


atgctgtttc 


600 


cctgtcgaca 


atgcgaaaga 


atgaatcaga 


attcaaatat 


ctgcttcaca 


cacgggcaaa 


660 


aagactgttt 


ggaatgcttt 


cccgtgtcag 


aatctcaacc 


cgtttctgtc 


gtcagaaaaa 


720 


cgtatcagaa 


actttgttac 


attcatcata 


tcatgggaaa 


agtaccagac 


gcctgcactg 


780 


cctgcgacct 


ggtaaatgtg 


gacttggatg 


actgtatttc 


tgagcaataa 


atgacttaaa 


840 


tcaggtatgg 


ctgctgacgg 


ttatcttcca 


gattggctcg 


aggacactct 


ctctgaagga 


900 


atcagacagt 


ggtggaagct 


caaacctggc 


ccaccaccgc 


cgaaacctaa 


ccaacaacac 


960 


cgggacgaca 


gtaggggtct 


tgtgcttcct 


gggtacaagt 


acctcggacc 


cttcaacgga 


1020 


ctcgacaaag 


gagagccggt 


caacgaggca 


gacgccgcgg 


ccctcgagca 


cgacaaagcc 


1060 


tacgaccacc 


agctcaagca 


aggggacaac 


ccgtacctca 


aatacaacca 


cgcggacgct 


1140 


gaatttcagg 


agcgtcttca 


agaagatacg 


tctttcgggg 


gcaacctcgg 


gcgagcagtc 


1200 


ttccaggcca 


aaaagagggt 


actcgagcct 


cttggtctgg 


ttgaggaagc 


tgttaagacg 


1260 


gctcctggaa 


aaaagagacc 


tatagagcag 


tctcctgcag 


aaccggactc 


ttcctcgggc 


1320 


atcggcaaat 


caggccagca 


gcccgctaag 


aaaagactca 


attttggtca 


gactggcgac 


1380 


acagagtcag 


tcccagaccc 


tcaaccaatc 


ggagaacccc 


ccgcagcccc 


ctctggtgtg 


1440 


ggatctaata 


caatggcttc 


aggcggtggg 


gcaccaatgg 


cagacaataa 


cgaaggcgcc 


1500 


gacggagtgg 


gtaattcctc 


gggaaattgg 


cattgcgatt 


ccacatggat 


gggcgacaga 


1560 


gttatcacca 


ccagcacaag 


aacctgggcc 


ctccccacct 


acaataatcg 


cctctacaag 


1620 


caaatctcca 


gcgaatcggg 


agccaccaac 


gacaaccact 


acttcggcta 


cagcaccccc 


1680 


tgggggtatt 


ttgactttaa 


cagattccac 


tgtcacttct 


caccacgtga 


ctggcagcga 


1740 


ctcatcaaca 


acaactgggg 


atttagaccc 


aagaaactca 


atttcaagct 


cttcaacatc 


1800 


caagtcaagg 


aggtcacgca 


gaatgatgga 


accacgacca 


tcgccaataa 


ccttaccagc 


1860 


acggtgcagg 


tcttcacaga 


ctctgagtac 


cagctgccct 


acgtcctcgg 


ttcggctcac 


1920 


cagggctgcc 


ttccgccgtt 


cccagcagac 


gtcttcatga 


ttcctcagta 


cggctacttg 


1980 


actctgaaca 


atggcagcca 


agcggtagga 


cgttct teat 


tctactgtc t 


agagtatttt 


2040 


ccctctcaga 


tgctgaggac 


gggaaacaac 


ttcaccttca 


gctacacttt 


tgaagacgtg 


2100 


cctttccaca 


gcagctacgc 


gcacagccag 


agtctggatc 


ggctgatgaa 


tcctctcatt 


2160 


gaccagtacc 


tgtattacct 


gagcaaaact 


cagggtacaa 


gtggaacaac 


gcagcaatcg 


2220 


agactgcagt 


tcagccaagc 


tgggcctagc 


tccatggctc 


agcaggccaa 


aaactggcta 


2280 
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ccgggaccca 


gctaccgaca 


gcagcgaatg 


tctaagacgg 


ctaatgacaa 


caacaacagt 


2340 


gaatttgctt 


ggactgcagc 


caccaaatat 


tacctgaatg 


gaagaaattc 


tctggtcaat 


2400 


cccgggcccc 


caatggccag 


tcacaaggac 


gatgaggaaa 


agtatttccc 


catgcacgga 


2460 


aatctcatct 


ttggaaaaca 


aggcacagga 


actaccaatg 


tggacattga 


atcagtgctt 


2520 


attacagacg 


aagaagaaat 


cagaacaact 


aatcctgtgg 


ctacagaaca 


atacggacag 


2580 


gttgccacca 


accatcagag 


tcagaacacc 


acagcttcct 


atggaagtgt 


ggacagccag 


2640 


ggaat cttac 


ctggaatggt 


gtggcaggac 


cgcgatgtct 


atcttcaagg 


tcccatttgg 


27C0 


gccaaaactc 


ctcacacgga 


cggacacttt 


catccttctc 


cgctcatggg 


aggctttgga 


2760 


ctgaaacacc 


ctcctcccca 


gatcctgatc 


aaaaacacac 


ctgtgccagc 


gaatcccgcg 


2820 


accactttca 


ctcctggaaa 


gtttgcttcg 


ttcattaccc 


agtattccac 


cggacaggtc 


2880 


ctyt^y uyycicict 


L-ctyciy uyyyci 


L.yL^cty cicici 
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2 94 0 


cagtacacct 


ccaactacaa 


caagtcggtg 


aatgtggagt 


ttaccgtgga 


cgcaaacggt 


3000 


gtttattctg 


aaccccgccc 


t attggcac t 


cgttacctta 


cccggaactt 


gt aatttcct 


3060 


gttaatgaat 


aaaccgattt 


atgcgtttca 


gttgaacttt 


ggtctctgcg 


aagggcgaat 


3120 


tc 












3122 



<210> 57 

<211> 3123 

30 <212> DNA 

<213> new AAV serotype, clone A3 . 3 

<400> 57 





gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


aaactttccc 


ttcaacgatt 


60 


35 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggaaagat 


gaccgccaag 


gtcgtggaat 


120 




ctgccaaagc 


cattctgggt 


ggaggcaagg 


ttcgtgtgga 


ccagaaatgc 


aagtcttcgg 


180 




cccagatcga 


cccgactccg 


gtgattgtca 


cctctaacac 


caacatgtgc 


gccgtgattg 


240 


40 


acggaaactc 


gaccaccttc 


gagcaccagc 


agccgttgca 


agaccggatg 


ttcaaatttg 


300 




aacttacccg 


ccgtttggat 


catgactttg 


ggaaggtcac 


caagcaggaa 


gtcaaagact 


360 




ttttccggtg 


ggctcaagat 


cacgtgactg 


aggtggagca 


tgagttctac 


gtcaaaaagg 


420 


45 


gtggagccaa 


gaaaaggccc 


gcccccgatg 


atgtatatat 


aaatgagccc 


aagcgggcgc 


480 




gcgagtcagt 


tgcgcagcca 


tcgacgtcag 


acgcggaagc 


ttcgataaac 


tacgcggaca 


540 




ggtaccaaaa 


caaatgttct 


cgtcacgtgg 


gcatgaatct 


gatgctgttt 


ccctgtcgac 


600 


50 


aatgcgaaag 


aatgaatcag 


aattcaaata 


tctgcttcac 


acacgggcaa 


aaagactgtt 


660 




tggaatgctt 


tcccgtgtca 


gaatctcaac 


ccgtttctgt 


cgtcagaaaa 


acgtatcaga 


720 




aactttgtta 


cattcatcat 


atcatgggaa 


aagtaccaga 


cgcctgcact 


gcctgcgacc 


780 


55 


tggtaaatgt 


ggacttggat 


gactgtattt 


ctgagcaata 


aatgacttaa 


atcaggtatg 


840 
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gctgctgacg 


gttatcttcc 


agattggctc 


gaggacactc 


tctctgaagg 


aatcagacag 


900 


tggtggaagc 


tcaaacctgg 


cccaccaccg 


ccgaaaccta 


accaacaaca 


ccgggacgac 


960 


agtaggggtc 


ttgtgcttcc 


tgggtacaag 


tacctcggac 


ccttcaacgg 


actcgacaaa 


1020 


ggagagccgg 


tcaacgaggc 


agacgccgcg 


gccctcgagc 


acgacaaagc 


ctacgaccac 


1080 


cagctcaagc 


aaggggacaa 


cccgtacctc 


aaatacaacc 


acgcggacgc 


tgaatttcag 


1140 


gagcgtcttc 


aagaagatac 


gtctttcggg 


ggcaacctcg 


ggcgagcagt 


cttccaggcc 


1200 


aaaaagaggg 


tactcgagcc 


tcttggtctg 


gttgaggaag 


ctgttaagac 


ggctcctgga 


1260 


aaaaagagac 


ctatagagca 


gtctcctgca 


gaaccggact 


cttcctcggg 


catcggcaaa 


1320 


tcaggccagc 


agcccgctaa 


gaaaagactc 


aattttggtc 


agactggcga 


cacagagtca 


1380 


gtcccaggcc 


ctcaaccaat 


cggagaaccc 


cccgcagccc 


cctctggtgt 


gggatc taat 


1440 


acaatggctt 


caggcggtgg 


ggcaccaatg 


gcagacaata 


acgaaggcgc 


cgacggagtg 


1500 


ggtaattcct 


cgggaaattg 


gcattgcgat 


tccacatgga 


tgggcgacag 


agttatcacc 


1560 


accagcacaa 


gaacctgggc 


cctccccacc 


tacaataatc 


acctctacaa 


gcaaatctcc 


1620 


agcgaatcgg 


gagccaccaa 


cgacaaccac 


tacttcggct 


acagcacccc 


ctgggggtat 


1680 


tttgacttta 


acagattcca 


ctgtcacttc 


tcaccacgtg 


actggcagcg 


actcatcaac 


1740 


aacaactggg 


gatttagacc 


caagaaactc 


aatttcaagc 


tcttcaacat 


ccaagtcaag 


1800 


gaggtcacgc 


agaatgatgg 


aaccacgacc 


atcgccaata 


accttaccag 


cgcggtgcag 


1860 


gtcttcacag 


actctgagta 


ccagctgccc 


tacgtcctcg 


gttcggctca 


ccagggctgc 


1920 


cttccgccgt 


tcccagcaga 


cgtcttcatg 


attcctcagt 


acggctactt 


gactctgaac 


1980 


aatggcagcc 


aagcggtagg 


acgttcttca 


ttctactgtc 


tagagtattt 


tccctctcag 


2040 


atgctgagga 


cgggaaacaa 


cttcaccttc 


agctacactt 


ttgaagacgt 


gcctttccac 


2100 


agcagctacg 


cgcacagcca 


gagtctggat 


cggctgatga 


atcctctcat 


tgaccagtac 


2160 


ctgtattacc 


tgagcaaaac 


tcagggtaca 


agtggaacaa 


cgcagcaatc 


gagactgcag 


2220 


ttcagccaag 


ctgggcctag 


ctccatggct 


cagcaggcca 


aaaactggct 


accgggaccc 


2280 


agctaccgac 


agcagcgaat 


gtctaagacg 


gctaatgaca 


acaacaacag 


tgaatttgct 


2340 


tggactgcag 


ccaccaaata 


ttacctgaat 


ggaagaaatt 


ctctggtcaa 


tcccgggccc 


2400 


ccagtggcca 


gtcacaagga 


cgatgaggaa 


aagtatttcc 


ccatgcacgg 


aaatctcatc 


2460 


Lr i» ^ y y Q. C4, ch. 


MCI ^ ^ ^a(»pCl «J 


Cl Cl l^aOk^MCl L- 








2 52 0 


gaagaagaaa 


tcagaacaac 


taatcctgtg 


gctacagaac 


aatacggaca 


ggttgccacc 


2580 


aaccatcaga 


gtcagaacac 


cacagcttcc 


tatggaagtg 


tggacagcca 


gggaatctta 


2640 


cctggaatgg 


tgtggcagga 


ccgcgatgtc 


tatcttcaag 


gtcccatttg 


ggccaaaact 


2700 


cctcacacgg 


acggacactt 


tcatccttct 


ccgctcatgg 


gaggctttgg 


actgaaacac 


2760 
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cctcctcccc agatcctgat caaaaacaca cctgtgccag cgaatcccgc gaccactttc 2820 

actcctggaa agtttgcttc gttcattacc cagtattcca cctgacaggt cagcgtggaa 2880 

atagagtggg agctgcagaa agaaaacagc aaacgctgga acccagaaat tcagtacacc 2 940 

tccaactaca acaagtcggt gaatgtggag tttaccgtgg acgcaaacgg tgtttattct 3000 

gaaccccgcc ctattggcac tcgttacctt acccggaact tgtaatttcc tgttaatgaa 3060 

taagccgatt tatgcgtttc agttgaactt tggtctctgc gaagggcgaa ttcgtttaaa 3120 

cct 3123 

i5 <210> 58 

<211> 2969 
<212> DNA 

<213> new AAV serotype, clone 42.12 
<400> 58 



20 


gaattcgccc 


tttctacggc 


tgcgtcaact 


ggaccaatga 


gaactttccc 


ttcaacgatt 


60 




gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


120 




ccgccaaggc 


cattctcggc 


ggcagcaagg 


tgcgcgtgga 


ccaaaagtgc 


aagtcgtccg 


180 


25 


cccagatcga 


ccccaccccc 


gtgatcgtca 


cctccaacac 


caacatgtgc 


gccgtgattg 


240 




acgggaacag 


caccaccttc 


gagcaccagc 


agccgttaca 


agaccggatg 


ttcaaatttg 


300 




aactcacccg 


ccgtctggag 


cacgactttg 


gcaaggtgac 


aaagcaggaa 


gtcaaagagt 


360 


30 


tcttccgctg 


ggcgcaggat 


cacgtgaccg 


aggtggcgca 


tgagttctac 


gtcagaaagg 


420 




gtggagccaa 


caagagaccc 


gcccccgatg 


acgcggataa 


aagcgagccc 


aagcgggcct 


480 




gcccctcagt 


cgcggatcca 


tcgacgtcag 


acgcggaagg 


agctccggtg 


gactttgccg 


540 


35 


acaggtacca 


aaacaaatgt 


tctcgtcacg 


cgggcatgct 


tcagatgctg 


tttccctgca 


600 




agacatgcga 


gagaatgaat 


cagaatttca 


acatttgctt 


cacgcacggg 


accagagact 


660 




gttcagaatg 


tttccccggc 


gtgtcagaat 


ctcaaccggt 


cgtcagaaag 


aggacgtatc 


720 


40 


ggaaactctg 


tgccattcat 


catctgctgg 


ggcgggctcc 


cgagattgct 


tgctcggcct 


780 




gcgatctggt 


caacgtggac 


ctggatgact 


gtgtttctga 


gcaataaatg 


acttaaacca 


840 




ggtatggctg 


ccgatggtta 


tcttccagat 


tggctcgagg 


acaacctctc 


tgagggcatc 


900 


45 


cgcgagtggt 


gggacttgaa 


acctggagcc 


ccgaaaccca 


aagccaacca 


gcaaaagcag 


960 




gacgacggcc 


ggggtctggt 


get tcctggc 


tacaagtacc 


tcggaccctt 


caacggactc 


1020 




gacaagggag 


agccggtcaa 


cgaggcagac 


gccgcggccc 


tcgagcacga 


caaggcctac 


1080 


50 


gacaagcagc 


tcgagcaggg 


ggacaacccg 


tacctcaagt 


acaaccacgc 


cgacgccgag 


1140 




tttcaggagc 


gtcttcaaga 


agatacgtct 


tttgggggca 


acctcgggcg 


agcagtcttc 


1200 




caggccaaga 


agcgggttct 


cgaacctctc 


ggtctggttg 


aggaaggcgc 


taagacggct 


1260 


55 


cctggaaaga 


agagaccggt 


agagccatca 


ccccagcgtt 


ctccagactc 


ctctacgggc 


1320 
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<210> 59 
<211> 3129 
<212> DNA 

<213> new AAV serotype, clone 44.2 
<400> 59 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 
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cacgtgactg 
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10 



15 



20 



25 



30 
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gacgtgtacc 
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2700 
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acacggacgg 
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27 60 
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2 8 2 0 
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acagcaccgg 


2880 
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gtggaaattg 
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2940 
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atctacaaat 
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3000 
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3120 
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3129 



35 



<210> 60 
<211> 733 
<212> PRT 

40 <213> capsid protein of AAV serotype, clone ClVPl 

<400> 60 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

45 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

50 Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

55 
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Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Leu Glu Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly Lys 
145 150 155 160 



Lys Gly Lys Gin Pro Ala Lys Lys Arg Leu Asn Phe Glu Glu Asp Thr 
165 170 175 



Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Asp Thr Ser Ala Met Ser 
180 185 190 



Ser Asp lie Glu Met Arg Ala Ala Pro Gly Gly Asn Ala Val Asp Ala 
195 200 205 



Gly Gin Gly Ser Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 
210 215 220 



Asp Ser Thr Trp Ser Glu Gly Lys Val Thr Thr Thr Ser Thr Arg Thr 
225 230 235 240 



Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Leu Arg Leu Gly Thr 
245 250 255 



Thr Ser Asn Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 
260 265 270 



Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
275 280 285 



Arg Leu lie Asn Asn Asn Trp Gly Leu Arg Pro Lys Ala Met Arg Val 
290 295 300 



Lys lie Phe Asn lie Gin Val Lys Glu Val Thr Thr Ser Asn Gly Glu 
305 310 315 320 



147 



EP1 310 571 A2 



Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin Tie Phe Ala Asp 
325 330 335 



Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu Gly Ser 
340 345 350 



Leu Ser Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr Gly Tyr 
355 360 365 



Cys Gly lie Val Thr Gly Glu Asn Gin Asn Gin Thr Asp Arg Asn Ala 
370 375 380 



Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn 
385 390 395 400 



Asn Phe Glu Met Ala Tyr Asn Phe Gly Lys Val Pro Phe His Ser Met 
405 410 415 



Tyr Ala Tyr Ser Gin Ssr Pro Asp Arg Leu Met Asn Pro Leu Leu Asp 
420 425 430 



Gin Tyr Leu Trp His Leu Gin ser Thr Thr Ser Gly Glu Thr Leu Asn 
435 440 445 



Gin Gly Asn Ala Ala Thr Thr Phe Gly Lys lie Arg Ser Gly Asp Phe 
450 455 460 



Ala Phe Tyr Arg Lys Asn Trp Leu Pro Gly Pro Cys Val Lys Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Thr Ala Ser Gin Asn Tyr Lys lie Pro Ala Ser Gly 
485 490 495 



Gly Asn Ala Leu Leu Lys Tyr Asp Thr His Tyr Thr Leu Asn Asn Arg 
500 505 510 



Trp Ser Asn lie Ala Pro Gly Pro Pro Met Ala Thr Ala Gly Pro Ser 
515 520 525 



Asp Gly Asp Phe Ser Asn Ala Gin Leu lie Phe Pro Gly Pro Ser Val 
530 535 540 



Thr Gly Asn Thr Thr Thr Ser Ala Asn Asn Leu Leu Phe Thr Ser Glu 
545 550 555 560 



Glu Glu He Ala Ala Thr Asn Pro Arg Asp Thr Asp Met Phe Gly Gin 
565 570 575 
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He Ala Asp Asn Asn Gin Asn Ala Thr Thx Ala Pro He Thr Gly Asn 
580 585 590 



Val Thr Ala Met Gly Val Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 



He Tyr Tyr Gin Gly Pro He Trp Ala Lys He Pro His Ala Asp Gly 
610 615 620 



His Phe His Pro Ser Pro Leu He Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin He Phe He Lys Asn Thr Pro Val Pro Ala Asn Pro Ala 
645 650 655 



Thr Thr Phe Thr Ala Ala Arg Val Asp Ser Phe He Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ala Val Gin He Glu Trp Glu He Glu Lys Glu Arg 
675 680 685 



Ser Lys Arg Trp Asn Pro Glu Val Gin Phe Thr Ser Asn Tyr Gly Asn 
690 695 700 



Gin Ser Ser Met Leu Trp Ala Pro Asp Thr Thr Gly Lys Tyr Thr Glu 
705 710 715 720 



Pro Arg Val He Gly Ser Arg Tyr Leu Thr Asn His Leu 
725 730 



<210> 61 

<211> 733 
<212> PRT 

<213> capsid protein of AAV serotype, clone C2VP1 

<400> 61 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Leu 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe His Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 
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Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Bhe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Leu Glu Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly Lys 
145 150 155 160 



Lys Gly Lys Gin Pro Ala Lys Lys Arg Leu Asn Phe Glu Glu Asp Thr 
165 170 175 



Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Asp Thr Ser Ala Met Ser 
180 185 190 



Ser Asp lie Glu Met Arg Ala Ala Pro Gly Gly Asn Ala Val Asp Ala 
195 200 205 



Gly Gin Gly Ser Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 
210 215 220 



Asp Ser Thr Trp Ser Glu Gly Lys Val Thr Thr Thr Ser Thr Arg Thr 
225 230 235 " 240 



Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Leu Arg Leu Gly Thr 
245 250 255 



Thr Ser Asn Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 
260 265 270 



Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
275 280 285 



Arg Leu lie Asn Asn Asn Trp Gly Leu Arg Pro Lys Ala Met Arg Val 
290 295 300 



Lys lie Phe Asn He Gin Val Lys Glu Val Thr Thr Ser A.sn Gly Glu 
305 310 315 320 



Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin He Phe Ala Asp 
325 330 335 
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Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Giu Gly Ser 
340 345 350 



Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr Gly Tyr 
355 360 365 



Cys Gly lie Val Thr Gly Glu Asn Gin Asn Gin Thr Asp Arg Asn Ala 
370 375 380 



Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn 
385 390 395 400 



Asn Phe Glu Met Ala Tyr Asn Phe Glu Lys Val Pro Phe His Ser Met 
405 410 415 



Tyr Ala His Ser Gin Ser Leu Asp Arg Lou Met Asn Pro Leu Leu Asp 
420 425 430 



Gin Tyr Leu Trp His Leu Gin Ser Thr Thr Ser Gly Glu Thr Leu Asn 
435 440 445 



Gin Gly Asn Ala Ala Thr Thr Phe Gly Lys lie Arg Ser Gly Asp Phe 
450 455 460 



Ala Phe Tyr Aig Lys Asn Trp Leu Pro Gly Pro Cys Val Lys Gin Gin 
465 470 475 480 



Arg Phe Ser Lys Thr Ala Ser Gin Asn Tyr Lys lie Pro Ala Ser Gly 
485 490 495 



Gly Asn Ala Leu Leu Lys Tyr Asp Thr His Tyr Thr Leu Asn Asn Arg 
500 505 510 



Trp Ser Asn lie Ala Pro Gly Pro Pro Met Ala Thr Ala Gly Pro Ser 
515 520 525 



Asp Gly Asp Phe Ser Asn Ala Gin Leu lie Phe Pro Gly Pro Ser Val 
530 535 540 



Thr Gly Asn Thr Thr Thr Ser Ala Asn Asn Leu Leu Phe Thr Ser Glu 
545 550 555 560 



Gly Glu lie Ala Ala Thr Asn Pro Arg Asp Thr Asp Met Phe Gly Gin 
565 570 575 



lie Ala Asp Asn Asn Gin Asn Ala Thr Thr Ala Pro lie Thr Gly Asn 
580 585 590 
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Val Thr Ala Met Gly Val Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 



He Tyr Tyr Gin Gly Pro He Trp Ala Lys He Pro His Ala Asp Gly 
610 €15 620 



His Phe His Pro Ser Pro Leu He Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin He Phe He Lys Asn Thr Pro Val Pro Ala Asn Pro Ala 
645 650 655 



Thr Thr Phe Thr Ala Ala Arg Val Asp Ser Phe He Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ala Val Gin He Glu Trp Glu He Glu Lys Glu Arg 
675 680 685 



Ser Lys Arg Arg Asn Pro Glu Val Gin Phe Thr Ser Asn Tyr Gly Asn 
690 €95 700 



Gin Ser Ser Met Leu Trp Ala Pro Asp Thr Thr Gly Lys Tyr Thr Glu 
705 710 715 720 



Pro Arg Val He Gly Ser Arg Tyr Leu Thr Asn His Leu 
725 730 



<210> 62 
<211> 733 
<212> PRT 

<213> capsid protein of AAV serotype, clone C5VP1@2 

<400> 62 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Glu Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 
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Asp Ala Glu Phe Gin Glu Arg Leu Gin Slu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Leu Glu Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly Lys 
145 150 155 160 



Lys Gly Lys Gin Pro Ala Lys Lys Arg Leu Asn Phe Glu Glu Asp Thr 
165 170 175 



Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Asp Thr Ser Ala Met Ser 
180 185 190 



Ser Asp rie Glu Met Arg Ala Ala Pro Gly Gly Asn Ala Val Asp Ala 
195 200 205 



Gly Gin Gly Ser Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 
210 215 220 



Asp Ser Thr Trp Ser Glu Gly Lys Val Thr Thr Thr Ser Thr Arg Thr 
225 230 235 240 



Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Leu Arg Leu Gly Thr 
245 250 255 



Thr Ser Asn Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 
260 265 270 



Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
275 280 285 



Arg Leu lie Asn Asn Asn Trp Gly Leu Arg Pro Lys Ala Met Arg Val 
290 295 300 



Lys lie Phe Asn He Gin Val Lys Glu Val Thr Thr Ser Asn Gly Glu 
305 310 315 320 



Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin lie Phe Ala Asp 
325 330 335 



Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu Gly Ser 

340 345 350 
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Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr Gly Tyr 
355 360 365 



Cys Gly lie Val Thr Gly Glu Asn Gin Asn Gin Thr Asp Arg Asn Ala 
370 375 380 



Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn 
385 390 395 400 



Asn Phe Glu Thr Ala Tyr Asn Phe Glu Lys Val Pro Phe His Ser Met 
405 410 415 



Tyr Ala His Ser Gin ser Leu Asp Gly Leu Met Asn Pro Leu Leu Asp 
420 425 430 



Gin Tyr Leu Trp His Leu Gin Ser Thr Thr Ser Gly Glu Thr Leu Asn 
435 440 445 



Gin Gly Asn Ala Ala Thr Thr Phe Gly Lys lie Arg Ser Gly Asp Phe 
450 455 460 



Ala Phe Tyr Arg Lys Asn Trp Leu Pro Gly Pro Cys Val Lys Gin Gin 
465 470 475 480 



Arg Phe Ser Lys Thr Ala Ser Gin Asn Tyr Lys lie Pro Ala Ser Gly 
485 490 495 



Gly Asn Ala Leu Leu Lys Tyr Asp Thr His Tyr Thr Leu Asn Asn Arg 
500 505 510 



Trp Ser Asn lie Ala Pro Gly Pro Pro Met Ala Thr Ala Gly Pro Ser 
515 520 525 



Asp Gly Asp Phe Ser Asn Ala Gin Leu He Phe Pro Gly Pro Ser Val 
530 535 540 



Thr Gly Asn Thr Thr Thr Ser Ala Asn Asn Leu Leu Phe Thr Ser Glu 
545 550 555 560 



Glu Glu He Ala Ala Thr Asn Pro Arg Asp Thr Asp Met Phe Gly Gin 
565 570 575 



He Ala Asp Asn Asn Gin Asn Ala Thr Thr Ala Pro He Thr Gly Asn 
580 585 590 



Val Thr Ala Met Gly Val Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 
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He Tyr Tyr Gin Gly Pro He Trp Ala Lys He Pro His Ala Asp Gly 
610 615 620 



His Phe His Pro Ser Pro Leu He Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin He Phe He Lys Asn Thr Pro Val Pro Ala Tyr Pro Ala 
645 650 655 



Thr Thr Phe Thr Ala Ala Arg Val Asp Ser Phe He Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ala Val Gin He Glu Trp Glu He Glu Lys Glu Arg 
675 680 685 



Ser Lys Arg Trp Asn Pro Glu Val Gin Phe Thr Ser Asn Cys Gly Asn 
690 695 700 



Gin Ser Ser Met Leu Trp Ala Pro Asp Thr Thr Gly Lys Tyr Thr Glu 
705 710 715 720 



Pro Arg Val He Gly Ser Arg Tyr Leu Thr Asn His Leu 
725 730 



<210> 63 
<211> 734 
<212> PRT 

<213> capsid protein of AAV serotype^ clone AAV4VP1 
<400> 63 

Met Thr Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser Glu 
15 10 15 



Gly Val Arg Glu Trp Trp Ala Leu Gin Pro Gly Ala Pro Lys Pro Lys 
20 25 30 



Ala Asn Gin Gin His Gin Asp Asn Ala Arg Gly Leu Val Leu Pro Gly 
35 40 45 



Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro Val 
50 55 60 



Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp Gin 
65 70 75 80 



Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 
85 90 95 



Ala Glu Phe Gin Gin Arg Leu Gin Gly Asp Thr Ser Phe Gly Gly Asn 
100 105 110 
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Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro Leu 
115 120 125 



Gly Leu Val Glu Gin Ala Gly Glu Thr Ala Pro Gly Lys Lys Arg Pro 
130 135 140 



Leu lie Glu Ser Pro Gin Gin Pro Asp Ser Ser Thr Gly He Gly Lys 
145 150 155 160 



Lys Gly Lys Gin Pro Ala Lys Lys Lys Leu Val Phe Glu Asp Glu Thr 
165 170 175 



Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Ihr Ser Gly Ala Met Ser 
180 185 190 



Asp Asp Ser Glu Met Arg Ala Ala Ala Gly Gly Ala Ala Val Glu Gly 
195 200 205 



Gly Gin Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 
210 215 220 



Asp Ser Thr Trp Ser Glu Gly His Val Thr Thr Thr Ser Thr Arg Thr 
225 230 235 240 



Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Arg Leu Gly Glu 
245 250 255 



Ser Leu Gin Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 
260 265 270 



Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
275 280 285 



Arg Leu He Asn Asn Asn Trp Gly Met Arg Pro Lys Ala Met Arg Val 
290 295 300 



Lys He Phe Asn He Gin Val Lys Glu Val Thr Thr Ser Asn Gly Glu 
305 310 315 320 



Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin He Phe Ala Asp 
325 330 335 



Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu Gly Ser 
340 345 350 



Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr Gly Tyr 
355 360 365 
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Cys Gly leu Val Thr Gly Asn Thr Ser Gin Gin Gin Thr Asp Arg Asn 
370 375 380 



Ala Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly 
385 390 395 400 



Asn Asn Phe Glu lie Thr Tyr Ser Phe Giu Lys Val Pro Phe His Ser 
405 410 415 



Met Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro Leu lie 
420 425 430 



Asp Gin Tyr Leu Trp Gly Leu Gin Ser Thr Thr Thr Gly Thr Thr Leu 
435 440 445 



Asn Ala Gly Thr Ala Thr Thr Asn Phe Thr Lys Leu Arg Pro Thr Asn 
450 455 460 



Phe Ser Asn Phe Lys Lys Asn Trp Leu Pro Gly Pro Ser lie Lys Gin 
465 470 475 480 



Gin Gly Phe Ser Lys Thr Ala Asn Gin Asn Tyr Lys lie Pro Ala Thr 
485 490 ' 495 



Gly Ser Asp Ser Leu lie Lys Tyr Glu Thr Hus Ser Thr Leu Asp Gly 
500 505 510 



Arg Trp Ser Ala Leu Thr Pro Gly Pro Pro Met Ala Thr Ala Gly Pro 
515 520 525 



Ala Asp Ser Lys Phe Ser Asn Ser Gin Leu lie Phe Ala Gly Pro Lys 
530 535 540 



Gin Asn Gly Asn Thr Ala Thr Val Pro Gly Thr Leu lie Phe Thr Ser 
545 550 555 560 



Glu Glu Glu Leu Ala Ala Thr Asn Ala Thr Asp Thr Asp Met Trp Gly 
565 570 575 



Asn Leu Pro Gly Gly Asp Gin ser Asn Ser Asn Leu Pro Thr Val Asp 
580 585 590 



Arg Leu Thr Ala Leu Gly Ala Val Pro Gly Met Val Trp Gin Asn Arg 
595 600 605 



Asp lie Tyr Tyr Gin Gly Pro lie Trp Ala Lys He Pro His Thr Asp 
610 615 620 
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Gly His Phe His Pro Ser Pro Leu lie Gly Gly Phe Gly Leu Lys His 
625 630 635 640 



Pro Pro Pro Gin lie Phe lie Lys Asn Thr Pro Val Pro Ala Asn Pro 
645 650 655 



Ala Thr Thr Phe Ser Ser Thr Pro Val Asn Ser Phe lie Thr Gin Tyr 
660 665 670 



Ser Thr Gly Gin Val Ser Val Gin lie Asp Trp Glu lie Gin Lys Glu 
675 680 685 



Arg Ser Lys Arg Trp Asn Pro Glu Val Gin Phe Thr Ser Asn Tyr Gly 
690 695 700 



Gin Gin Asn Ser Leu Leu Trp Ala Pro Asp Ala Ala Gly Lys Tyr Thr 
705 710 715 720 



Glu Pro Arg Ala lie Gly Thr Arg Tyr Leu Thr His His Leu 
725 730 

<210> 64 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAVl 

<400> 54 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 
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EP1 310 571 A2 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Zys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
260 265 270 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
275 280 285 



His Cys His Phe Ser Pro Arg Asp Tarp Gin Arg Leu lie Asn Asn Asn 
290 295 300 



Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin 
305 310 315 320 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 
325 330 335 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
340 345 350 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
355 360 365 



Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
370 375 380 
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EP1 310 571 A2 



Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
405 410 415 



Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
420 425 430 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 
435 440 445 



Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 
450 455 460 



Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 
485 490 4S5 



Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
500 505 510 



Gly Arg Glu Ser lie lie Asn Pro Gly Thr Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met lie Phe Gly 
530 535 540 



Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met He 
545 550 555 560 



Thr Asp Glu Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
565 570 575 



Phe Gly Thr Val Ala Val Asn Phe Gin Ser Ser Ser Thr Asp Pro Ala 
580 585 590 



Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 



Asp Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
525 630 535 640 
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EP1 310 571 A2 



Lys Asn Pro Pro Pro Gin lie Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe He Thr 

660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705 710 715 720 



Tyr Thr Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

<210> 65 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV6VP1 
<400> 65 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



161 



EP1 310 571 A2 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 

145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
260 265 270 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
275 280 285 



His cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
290 295 300 



Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin 
305 310 315 320 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
325 330 335 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
340 345 350 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
355 360 365 



Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
370 375 380 



Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 
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EP1 310 571 A2 



Ser Gin Met Leu Axg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
405 410 415 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
420 425 430 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 
435 440 445 



Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 
450 455 460 



Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 
485 490 495 



Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
500 505 510 



Gly Arg Glu Ser lie lie Asn Pro Gly Thr Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val I4et lie Phe Gly 
530 535 540 



Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Yal Met lie 
545 550 555 560 



Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
565 570 575 



Phe Gly Thr Val Ala Val Asn Leu Gin Ser Ser Ser Thr Asp Pro Ala 
580 585 590 



Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 



Asp Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 
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EP1 310 571 A2 



Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe lie Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705 710 715 720 



Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 



<210> 66 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone A3 , 3 
<400> 66 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
15 10 15 



Glu Gly lie Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 



Lys Pro Asn Gin Gin His Arg Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



His Gin Leu Lys Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Ala Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 
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EP1 310 571 A2 



Pro lie Glu Gin Ser Pro Ala Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 155 160 



Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Thr Glu Ser Val Pro Gly Pro Gin Pro He Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val He 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His leu 
245 250 255 



Tyr Lys Gin He Ser Ser Glu Ser Gly Ala Thr Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin Val 
305 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr He Ala Asn Asn Leu 
325 330 335 



Thr Ser Ala Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Sly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 
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EP1 310 571 A2 



Gin Met Leu Axg Thx Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ser Lys Thr 
435 440 445 



Gin Gly Thr Ser Gly Thr Thr Gin Gin Ser Arg Leu Gin Phe Ser Gin 
450 455 460 



Ala Gly Pro Ser Ser Met Ala Gin Gin Ala Lys Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Ser Tyr Arg Gin Gin Arg Met Ser Lys Thr Ala Asn Asp Asn Asn 
485 490 495 



Asn Ser Glu Phe Ala Trp Thr Ala Ala Thr Lys Tyr Tyr Leu Asn Gly 
500 505 510 



Arg Asn Ser Leu Val Asn Pro Gly Pro Pro Val Ala Ser His Lys Asp 
515 520 525 



Asp Glu Glu Lys Tyr Phe Pro Met His Gly Asn Leu lie Phe Gly Lys 
530 535 540 



Gin Gly Thr Gly Thr Thr Asn Val Asp lie Glu Ser Val Leu lie Thr 
545 550 555 560 



Asp Glu Glu Glu lie Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Gin Val Ala Thr Asn His Gin Ser Gin Asn Thr Thr Ala Ser Tyr 
580 585 590 



Gly Ser Val Asp Ser Gin Gly lie Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys Thr Pro His Thr 
610 615 620 



Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 
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EP1 310 571 A2 



Pro Ala Thr Thr Phe Thr Pro Gly Lys Phe Ala Ser Phe lie Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin Lys 
675 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn Tyr 
690 695 700 



Asn Lys Ser Val Asn Val Glu Phe Thr Val Asp Ala Asn Gly Val Tyr 
705 710 715 720 



Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 67 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone A3. 7 
<400> 57 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
15 10 15 



Glu Gly lie Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 



Lys Pro Asn Gin Gin His Arg Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



His Gin Leu Lys Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Ala Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Gin Ser Pro Ala Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 
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EP1 310 571 A2 



Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Thr Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
IBO 185 190 



Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val He 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn Arg Leu 
245 250 255 



Tyr Lys Gin lie Ser Ser Glu Ser Gly Ala Thr Asn Asp Asn His Tyr 
250 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin Val 
305 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr He Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 
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EP1 310 571 A2 



Asp VaL Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu ser Lys Thr 
435 440 445 



Gin Gly Thr Ser Gly Thr Thr Gin Gin Ser Arg Leu Gin Phe Ser Gin 
450 455 460 



Ala Gly Pro Ser Ser Met Ala Gin Gin Ala Lys Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Ser Tyr Arg Gin Gin Arg Met Ser Lys Thr Ala Asn Asp Asn Asn 
485 490 495 



Asn Ser Glu Phe Ala Trp Thr Ala Ala Thr Lys Tyr Tyr Leu Asn Gly 
500 505 510 



Arg Asn Ser Leu Val Asn Pro Gly Pro Pro Met Ala Ser His Lys Asp 
515 520 525 



Asp Glu Glu Lys Tyr Phe Pro Met His Gly Asn Leu lie Phe Gly Lys 
530 535 540 



Gin Gly Thr Gly Thr Thr Asn Val Asp lie Glu Ser Val Leu lie Thr 

545 550 555 560 



Asp Glu Glu Glu lie Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Gin Val Ala Thr Asn His Gin Ser Gin Asn Thr Thr Ala Ser Tyr 
530 585 590 



Gly Ser Val Asp Ser Gin Gly lie Leu Pro Gly Met Val Trp Gin Asp 
595 500 605 



Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys Thr Pro His Thr 
610 615 620 



Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Ala Thr Thr Phe Thr Pro Gly Lys Phe Ala Ser Phe lie Thr Gin 
660 665 570 
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EP1 310 571 A2 



Tyr Ser Thr Gly Gin Val Ser Val Glu lie Giu Trp Glu Leu Gin Lys 
675 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Giu lie Gin Tyr Thr Ser Asn Tyr 
690 695 700 



Asn Lys Ser Val Asn Val Giu Phe Thr Val Asp Ala Asn Gly Val Tyr 

705 710 715 720 



Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 68 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone A3 . 4 
<'400> 68 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
15 10 15 

Glu Gly lie Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 



Lys Pro Asn Gin Gin His Arg Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



His Gin Leu Lys Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Ala Val Lys Thr Ala Pro Gly lys Lys Arg 
130 135 140 



Pro lie Glu Gin Ser Pro Ala Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Glu Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 
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EP1 310 571 A2 



Gly Asp Thr Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asp Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Glu Scr Gly Ala Thr Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin Val 
305 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr He Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 
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EP1 310 571 A2 



Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ser Lys Thr 
435 440 445 



Gin Gly Thr ser Gly Thr Thr Gin Gin Ser Arg Leu Gin Phe Ser Gin 
450 455 460 



Ala Gly Pro Ser Ser Met Ala Gin Gin Ala Lys Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Ser Tyr Arg Gin Gin Arg Met Ser Lys Thr Ala Asn Asp Asn Asn 
485 490 495 



Asn Ser Glu Phe Ala Trp Thr Ala Ala Thr Lys Tyr Tyr Leu Asn Gly 
500 505 510 



Arg Asn Ser Leu Val Asn Pro Gly Pro Pro Met Ala Sor His Lys Asp 
515 520 525 



Asp Glu Glu Lys Tyr Phe Pro Met His Gly Asn Leu lie Phe Gly Lys 
530 535 540 



Gin Gly Thr Gly Thr Thr Asn Val Asp lie Glu Ser Val Leu He Thr 
545 550 555 560 



Asp Glu Glu Glu lie Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 

565 570 575 



Gly Gin val Ala Thr Asn His Gin Ser Gin Asp Thr Thr Ala Ser Tyr 
580 585 590 



Gly Ser Val Asp Ser Gin Gly He Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys Thr Pro His Thr 
610 615 620 



Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Ala Thr Thr Phe Thr Pro Gly Lys Phe Ala Ser Phe He Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 680 685 
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EP1 310 571 A2 



Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn Tyr 
690 695 700 



Asn Lys Ser Val Asn Val Glu Phe Thr Val Asp Ala Asn Gly Val Tyr 

705 710 715 720 



Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 

<210> 69 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone A3 . 5 
<400> 69 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
15 10 15 



Glu Gly lie Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 



Lys Pro Asn Gin Gin His Arg Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



His Gin Leu Lys Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Ala Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Gin Ser Pro Ala Glu Pro Asp Sar Ser Ser Gly He Gly 
145 150 155 160 



Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Thr Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
180 185 190 
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EP1 310 571 A2 



Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 203 



Ala Fro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val He 
225 230 235 240 



Tlir Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Ser Glu Ser Gly Ala Thr Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin Val 
305 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr He Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Vdl Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Lys Thr 
435 440 445 



174 



EP1 310 571 A2 



Gin Gly Thr Ser Gly Thr Thr Gin Gin ser Arg Leu Gin Phe Asn Gin 
450 455 460 



Ala Gly Pro Ser Ser Met Ala Gin Gin Ala Lys Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Ser Tyr Arg Gin Gin Arg Met Ser Lys Thr Ala Asn Asp Asn Asn 
485 490 495 



Asn Ser Glu Phe Als Trp Thr Ala Ala Thr Lys Tyr Tyr Pro Asn Gly 
500 505 510 



Arg Asn Ser Leu Val Asn Pro Gly Pro Pro Met Ala Ser His Lys Asp 
515 520 525 



Asp Glu Glu Lys Tyr Phe Pro Met His Gly Asn Leu lie Phe Gly Lys 
530 535 540 



Gin Gly Thr Gly Thr Thr Asn Val Asp lie Glu Ser Val Leu He Thr 
545 550 555 560 



Asp Glu Glu Glu He Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Gin Val Ala Thr Asn Arg Gin Ser Gin Asn Thr Thr Ala Ser Tyr 
580 585 590 



Gly Ser Val Asp Ser Gin Gly He Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys Thr Pro His Thr 
610 615 620 



Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Ala Thr Thr Phe Thr Pro Gly Lys Phe Ala Ser Phe He Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 



175 



EP1 310 571 A2 



Asn Lys Ser Val Asn Val Glu Phe Thr Val Asp Ala Asn Gly Val Tyr 
705 710 715 720 



Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Xeu Thr Arg Asn Leu 
725 730 735 



<210> 70 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV2 
<400> 70 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
15 10 15 



Glu Gly lie Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 



Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Arg Gin Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 
145 150 155 160 



Lys Ala Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ala Asp Ser Val Pro Asp Pro Gin Pro Leu Gly Gin Pro Pro 
180 185 190 



176 



EP1 310 571 A2 



Ala Ala Pro Ser Gly Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Ser Gin Ser Gly Ala Ser Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin Val 
305 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr He Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr 
435 440 445 
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EP1 310 571 A2 



Asn Thr Pro ser Gly Thr Thr Thr Gin Ser Arg Leu Gin Phe Ser Gin 
450 455 460 



Ala Gly Ala Ser Asp lie Arg Asp Gin Ser Arg Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Ser Ala Asp Asn Asn 
485 490 495 



Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 
500 505 510 



Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 
515 520 525 



Asp Glu Glu Lys Phe Phe Pro Gin Ser Gly Val Leu lie Phe Gly Lys 
530 535 540 



Gin Gly Ser Glu Lys Thr Asn Val Asp lie Glu Lys Val Met lie Thr 
545 550 555 560 



Asp Glu Glu Glu lie Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Ser Val Ser Thr Asn Leu Gin Arg Gly Asn Arg Gin Ala Ala Thr 
580 585 590 



Ala Asp Val Asn Thr Gin Gly Val Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr 
610 615 620 



Asp Gly His Phe His Pro ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin lie Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe He Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 
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EP1 310 571 A2 



Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 
705 710 715 720 



Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 

<210> 71 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV3 
<400> 71 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gin Pro 
20 25 30 



Lys Ala Asn Gin Gin His Gin Asp Asn Arg Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg lie Leu Glu Pro 
115 120 ^ 125 



Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Gly 
130 135 140 



Ala val Asp Gin ser Pro Gin Glu Pro Asp Ser Ser Ser Gly Val Gly 
145 150 155 160 



Lys Ser Gly Lys Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 
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EP1 310 571 A2 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Gin Trp Leu Gly Asp Arg Val He 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Gin Ser Gly Ala Ser Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Lys Leu Ser Phe Lys Leu Phe Asn He Gin Val 
305 310 315 320 



Arg Gly Val Thr Gin Asn Asp Gly Thr Thr Thr He Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg Thr 
435 440 445 



Gin Gly Thr Thr Ser Gly Thr Thr Asn Gin Ser Arg Leu Leu Phe Ser 
450 455 460 



180 



EP1 310 571 A2 



Gin Ala Gly Pro Gin Ser Met Ser Leu Gin Ala Arg Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Leu Ser Lys Thr Ala Asn Asp Asn 
485 490 495 



Asn Asn Ser Asn Phe Pro Trp Thr Ala Ala Ser Lys Tyr His Leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Glu Glu Lys Phe Phe Pro Met His Gly Asn Leu He Phe Gly 
530 535 540 



Lys Glu Gly Thr Thr Ala Ser Asn Ala Glu Leu Asp Asn Val Met He 
545 550 555 560 



Thr Asp Glu Glu Glu He Arg Thr Thr Asn Pro Val Ala Thr Glu Gin 
565 570 575 



Tyr Gly Thr Val Ala Asn Asn Leu Gin Ser Ser Asn Thr Ala Pro Thr 
580 585 590 



Thr Gly Thr Val Asn His Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 



Asp Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 



Lys His Pro Pro Pro Gin He Met He Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asn Pro Pro Thr Thr Phe Ser Pro Ala Lys Phe Ala Ser Phe He Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val 
705 710 715 720 
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EP1 310 571 A2 



Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 72 
<211> 737 
<212> PRT 

<213> capsid protein of AAV serotype, clone 3.3bVPl 
<400> 72 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Asn Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 " 160 



Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro 
180 185 190 



Pro Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly 
195 200 205 



182 



EP1 310 571 A2 



Gly Ala Pro Met Ala Asp Asn Asn Gla Gly Ala Asp Gly Val Gly Asn 
210 215 220 



Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Glu Gin lie Ser Ser Glu Thr Ala Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Sin Arg Leu lie Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn lie 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr lie Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 350 365 



Ala Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Giy Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala 
435 440 445 



Arg Thr Gin Ser Asp Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin 
450 455 460 



183 



EP1 310 571 A2 



Phe Tyr Gin Gly Gly Pro Ser Thr Met Ala Glu Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pre Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie 
530 535 540 



Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn Val Leu 
545 550 555 560 



Met Thr Asn Glu Glu Glu lie Arg Pre Thr Asn Pro Val Ala Thr Glu 
565 570 575 



Glu Tyr Gly lie Val Ser Ser Asn Leu Gin Ala Ala Asn Thr Ala Ala 
580 585 590 



Gin Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp 
595 600 605 



Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro 
610 615 620 



His Thr Asp Gly Asn Phe His Pro Ser Pro Xeu Met Gly Gly Phe Gly 
625 630 635 640 



Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro 
645 650 655 



Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe lie 
660 665 670 



Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu 
675 680 685 



Gin Lys Glu Asn Ser Lys Arg Trp Asp Pro Glu lie Gin Tyr Thr Ser 
690 695 700 



Asn Phe Glu Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly 
705 710 715 720 
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EP1 310 571 A2 



Val Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn 
725 730 735 



Leu 



<210> 73 
<211> 644 
<212> PRT 

<213> capsid protein of AAV serotype, clone 223-4 
<400> 73 

Lys Ala Tyr Asp Gin. Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
15 10 15 



Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 



Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 



Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 



Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly lie Gly 
65 70 75 80 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 



Gly Asp Ser Glu Pro Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
100 105 110 



Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 



Ser Gly Asn Trp His Cys Asp Ser Thr Arg Leu Gly Asp Arg Val lie 
145 150 155 160 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 



Tyr Lys Gin lie Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 



185 



EP1 310 571 A2 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
210 215 220 



Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Gly 
340 345 350 



Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu He Phe 
450 455 460 



186 



EP1 310 571 A2 



GLy Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly lie val ser Ser Asn Leu Gin Ala Ala ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
530 535 540 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
565 570 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He Thr 
580 585 590 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin 
595 600 605 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn 
610 615 620 



Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 



Tyr Ser Glu Pro 



<210> 74 
<211> 644 
<212> PRT 

<213> capsid protein of AAV serotype, clone 223,5 
<400> 74 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
15 10 15 



Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 



Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 



187 



EP1 310 571 A2 



Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 



Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly lie Gly 
65 70 75 80 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 



Gly Asp Ser Glu Pro Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
100 105 110 



Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 



Ser Gly Asn Trp His Cys Asp Ser Thr Arg Leu Gly Asp Arg Val lie 
145 150 155 160 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 



Tyr Lys Gin lie Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 



His cys His Phe ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
210 215 220 



Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 
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EP1 310 571 A2 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Gly 
340 345 350 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe 
450 455 460 



Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly lie Val Ser ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
530 535 540 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 



189 



EP1 310 571 A2 



Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 
565 570 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He Thr 
580 585 590 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 615 620 



Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 



Tyr Ser Glu Pro 



<210> 75 

<211> 644 

<212> PRT 

<213> capsid protein of AAV serotype, clone 223.10 
<220> 

< 2 2 1 > MI S C_FE ATURE 

<222> (434) . . (434) 

<22 3> can be any amino acid 

<400> 75 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
15 10 15 



Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 



Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 



Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 



Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly He Gly 
65 70 75 60 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
100 105 110 



190 



EP1 310 571 A2 



Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
145 150 155 160 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 



Tyr Lys Gin lie Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

210 215 220 



Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn lie Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
340 345 350 



Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 
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Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 430 



Asn Xaa Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe 
450 455 460 



Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Lsu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly lie Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
530 535 540 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 



Lys His Pro Pro Pro Gin lie Leu He Lys Asn Thr Pro Val Pro Ala 
565 570 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He Thr 
580 585 590 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 615 620 
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Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 



Tyr Ser Glu Pro 



<210> 76 
<211> 644 
<212> PRT 

<213> capsid protein of AAV serotype, clone 223.2 
<400> 76 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
15 10 15 



Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Cys Leu Gin Glu Asp Thr 
20 25 30 



Ser Phe Gly Gly Asn leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 



Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 



Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly lie Gly 
65 70 75 80 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
100 105 110 



Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Val Ala Gly Gly Gly 
115 120 125 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
145 150 155 160 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 



Tyr Lys Gin lie Sex Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 
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EP1 310 571 A2 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 
210 215 220 



Trp Gly Phe Arg Pro Lys Lys Leu Asa Phe Lys Leu Phe Asn He Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
340 345 350 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly ALa Thr Lys Tyr His Leu 
420 425 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



lys Asp Asp Glu Glu Arg Phe ser Pro Ser Ser Gly Val Leu He Phe 
450 455 460 
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Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly lie Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
530 535 540 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 



Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 
565 570 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe lie Thr 
580 585 590 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 615 620 



Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 



Tyr Ser Glu Pro 



<210> 77 
<211> 644 
<212> PRT 

<213> capsid protein of AAV serotype, clone 223.7 
<400> 77 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
15 10 15 



Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 
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Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 



Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 50 



Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly lie Gly 
65 70 75 80 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 ' 95 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
100 105 110 



Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
145 150 155 160 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 



Tyr Lys Gin lie Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 ISO 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asm 
210 215 220 



Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn lie Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Pro Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 ' 285 
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Asp Vai Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
340 345 350 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Sin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe 
450 455 460 



Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly Xle Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
530 535 540 
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Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Giy Leu 
545 550 555 560 



Ly3 His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 
565 570 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys lie Ala Ser Phe lie Thr 
580 585 590 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin 
595 600 605 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn 
610 615 620 



Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 



Tyr Ser Glu Pro 



<210> 78 

<211> 644 

<212> PRT 

<213> capsid protein of AAV serotype, clone 223.6 

<400> 78 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
1 5 10 ~ 15 



Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 



Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 



Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 



Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly lie Gly 
65 70 75 80 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
100 105 110 



Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 
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Ala Pro Met Ala Asp Asn Ser Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 



Ser Gly Asn Trp His Cys Asp ser Thr Trp Leu Gly Asp Arg Val lie 
145 150 155 160 



Thr Thr ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 

165 170 175 



Tyr Lys Gin lie Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 
210 215 220 



Trp Gly Phe Arg Pro Lys lys Leu Asn Phe Lys Leu Phe Asn lie Gin 

225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 

260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

290 295 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala Eis Ser Gin Ser Leu Asp 
340 345 350 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 
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Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thx Gly Ala Thr Lys Tyr His Leu 
420 425 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe 
450 455 460 



Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu He Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly He Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
530 535 540 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
565 570 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Leu Ala Ser Phe He Thr 
580 585 590 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
510 615 620 



Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 
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EP1 310 571 A2 



Tyr Ser Glu Pro 

5 

<210> 79 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 44.1 

10 

<400> 79 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

25 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 

35 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 

45 

Gly Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 
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EP1 310 571 A2 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



lie Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 
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EP1 310 571 A2 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 

565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro He Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 710 715 720 
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EP1 310 571 A2 



Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 80 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 44.5 
<400> 80 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly He 
145 150 155 160 



Gly Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 
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EP1 310 571 A2 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Pro Asn Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ale 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 



205 



EP1 310 571 A2 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 

660 665 670 



lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 710 715 720 
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EP1 310 571 A2 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 81 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 44.2 
<400> 81 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 
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EP1 310 571 A2 



Giy Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Sor Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 



208 



EP1 310 571 A2 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 710 715 720 



209 



EP1 310 571 A2 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 82 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 29,3VP1 
<400> 82 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly lie Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 

115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Thr Thr Gly lie 
145 150 155 160 



Gly Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



210 



EP1 310 571 A2 



Gly Ala Pro Met Ala Asp Asm Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala Arg Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 



211 



EP1 310 571 A2 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Gly Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro He Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 710 715 720 



212 



EP1 310 571 A2 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 83 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 29.5VP1 
<400> 83 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



213 



EP1 310 571 A2 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Gly Val 
225 230 235 240 



He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
250 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Ser Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



214 



EP1 310 571 A2 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asp Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro He Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 



215 



EP1 310 571 A2 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 84 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.15 
<400> 84 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



216 



EP1 310 571 A2 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 

195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



lie Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr lie Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr lie Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Pro Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Arg Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



217 



EP1 310 571 A2 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr 
690 695 700 



218 



EP1 310 571 A2 



Ser Asn Tyr Tyr Lys Ser Tlir Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 85 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.8 
<400> 85 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



219 



EP1 310 571 A2 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



220 



EP1 310 571 A2 



Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 515 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie Leu He Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



221 



EP1 310 571 A2 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 86 
<211> 733 
<212> PRT 

<213> amino acid of AAV serotype, clone 42.13 

<400> 86 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro He Glu Ser Pro Asp Ser Ser Thr Gly He Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



222 



EP1 310 571 A2 



Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 

Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 



Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn Thr Tyr Phe Gly 
260 265 270 



Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 
275 280 285 



Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp Gly Phe 
290 295 300 



Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin Val Lys Glu 
305 310 315 320 



Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala Asn Asn Leu Thr Ser 
325 330 335 



Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
340 345 350 



Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
355 360 365 



Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala 
370 375 380 



Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
385 390 395 400 



Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val 
405 410 415 



Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met 
420 425 430 



223 



EP1 310 571 A2 



Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser 
435 440 445 



Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly 
450 455 460 



Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys 
465 470 475 480 



Tyr Arg Gin Gin Arg Val Ser Thr Thr Val Ser Gin Asn Asn Asn Ser 
485 490 495 



Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp 
500 505 510 



Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His Lys Gly Asp Glu 
515 520 525 



Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly 
530 535 540 



Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu 
545 550 555 560 



Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val 
565 570 575 



Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro He Val Gly Ala 
580 585 590 



Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 



Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr Asp Gly 
610 615 620 



Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asp Pro Pro 
645 650 655 



Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe He Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn 
675 680 685 



224 



EP1 310 571 A2 



Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn Tyr Tyr Lys 
690 695 700 



Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu 
705 710 715 720 



Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Ser Leu 
725 730 



<210> 87 
<211> 733 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42 . 3A 
<400> 87 

Met Ala Ala Asp Gly His Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 

Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 
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EP1 310 571 A2 



Gly Leu Gly Ser &ly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 



Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn Thr Tyr Phe Gly 
260 265 270 



Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 
275 280 285 



Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Ser Trp Gly Phe 
290 295 300 



Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin Val Lys Glu 
305 310 315 320 



Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala Asn Asn Leu Thr Ser 
325 330 335 



Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
340 345 350 



Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
355 360 365 



Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala 
370 375 380 



Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
385 390 395 400 



Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val 
405 410 415 



Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met 
420 425 430 



Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser 
435 440 445 
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EP1 310 571 A2 



Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly 
450 455 460 



Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys 
465 470 475 480 



Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser Gin Asn Asn Asn Ser 
485 490 495 



Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp 
500 505 510 



Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His Lys Asp Asp Glu 
515 520 525 



Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly 
530 535 540 



Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu 
545 550 555 560 



Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val 
565 570 575 



Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro lie Val Gly Ala 
580 585 590 



Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 



Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly 
610 615 620 



Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asp Pro Pro 
645 650 655 



Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe He Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn 
675 680 685 



Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Tyr Lys 
690 695 700 
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EP1 310 571 A2 



Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu 
705 710 715 720 



Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 



<210> 88 
<211> 731 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.4 
<400> 88 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 
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EP1 310 571 A2 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Ser Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 



Gin Asn Glu Gly Thr Lys Thr lie Ala Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Thr Asp Ser Glu Tyr Arg Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser Thr Gly 
435 440 445 
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EP1 310 571 A2 



Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly Pro Asn 
450 455 460 



Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg 
465 470 475 480 



Gin Gin Arg Val Ser Thr Thr Leu Ser Gin Asn Asn Asn Ser Asn Phe 
485 490 495 



Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp Ser Leu 
500 505 510 



Val Asn Pro Gly Val Ala Met Ala Thr His Lys Asp Asp Glu Glu Arg 
515 520 525 



Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly Ala Gly 
530 535 540 



Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu Glu Glu 
545 550 555 560 



lie Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val Val Ala 
565 570 575 



Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro lie Val Gly Ala Val Asn 
580 585 590 



Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr 
595 600 605 



Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe 
610 615 620 



His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro 
625 630 635 640 



Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr 
645 650 655 



Phe Ser Gin Ala Lys Pro Ala Ser Phe He Thr Gin Tyr Ser Thr Gly 
660 665 670 



Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys 
675 680 685 



Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr 
690 695 700 
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Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thi: Tyr Ser Glu Pro Arg 
705 710 715 720 



Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 



<210> 89 
<211> 731 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42. 5A 
<400> 89 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Arg Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly He Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 
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EP1 310 571 A2 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Arg Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn He Gin Val Lys Glu Val Thr 
305 310 315 320 



Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 

Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser Thr Gly 
435 440 445 



Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly Pro Asn 
450 455 460 
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EP1 310 571 A2 



Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg 
465 470 475 480 



Gin Gin Arg Val Ser Thr Thr Leu Ser Gin Asn Asn Asn Ser Asn Phe 
485 490 495 



Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp Ser Leu 
500 505 510 



Val Asn Pro Gly Val Ala Met Ala Thr His Lys Asp Asp Glu Glu Arg 
515 520 525 



Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly Ala Gly 
530 535 540 



Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu Glu Glu 
545 550 555 560 



lie Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val Val Ala 
565 570 575 



Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro lie Val Gly Ala Val Asn 
580 585 590 



Ser Gin Gly Ala Leu Pro Gly Met Ala Trp Gin Asn Arg Asp Val Tyr 
595 600 605 



Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe 
610 615 620 



His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro 
625 630 635 640 



Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr 
645 650 655 



Phe Ser Gin Ala Lys Leu Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly 
660 665 670 



Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys 
675 680 685 



Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr 
690 695 700 



Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu Pro Arg 
705 710 715 720 
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EP1 310 571 A2 



Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 



<210> 90 
<211> 733 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42. IB 
<400> 90 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Arg Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp 
210 215 220 
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EP1 310 571 A2 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn Thr Tyr Phe Gly 
260 265 270 



Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 
275 280 285 



Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe 
290 295 300 



Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin Val Lys Glu 
305 310 315 320 



Val Thr Gin Asn Glu Gly Thr Lys Thr lie Ala Asn Asn Leu Thr Ser 
325 330 335 

Thr lie Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
340 345 350 



Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
355 360 365 



Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala 
370 375 380 



Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
385 390 395 400 



Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val 
405 410 415 



Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met 
420 425 430 



Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser 
435 440 445 



Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly 
450 455 460 



Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys 
465 470 475 480 
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EP1 310 571 A2 



Tyr Arg Gin Gin Arg Val Ser Thr Thr Val Ser Gin Asn Asn Asn Ser 
485 490 495 



Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp 
500 505 510 



Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His Lys Gly Asp Glu 
515 520 525 



Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly 
530 535 540 



Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu 
545 550 555 560 



Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val 
565 570 575 



Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro He Val Gly Ala 
580 585 590 



Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 



Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr Asp Gly 
610 615 620 



Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asp Pro Pro 
645 650 655 



Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe He Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn 
675 680 685 



Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Tyr Lys 
690 695 700 



Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu 
705 710 715 720 



Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 
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EP1 310 571 A2 



<210> 91 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42 , 5B 
<400> 91 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

15 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 
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EP1 310 571 A2 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Xeu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 

lie Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr lie Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr lie Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 
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EP1 310 571 A2 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro He Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 555 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 

Asn Leu 



<210> 92 

<211> 738 

<212> PRT 

<213> capsid protein of 7\AV serotype, clone 43.1 
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EP1 310 571 A2 



<400> 92 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Lys Gly His Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 
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EP1 310 571 A2 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 

275 280 285 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Pro Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Gin Gly Thr Gin Gin Leu Leu 
450 455 460 



Phe Ser Gin Ala Gly Pro Ala Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 

485 490 495 

Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 
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His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asa Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Thr Asn Gly Ala 
580 585 590 



Pro lie Val Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie L eu Val Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 93 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 43.12 
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<400> 93 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Lys Gly His Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 

165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 
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Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



lie Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Gin Gly Thr Gin Gin Leu Leu 
450 455 460 



Phe Ser Gin Ala Gly Pro Ala Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 
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Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Thr Asn Gly Ala 
580 585 590 



Pro lie Val Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie Leu Val Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 94 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 43.5 
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<400> 94 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ale 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 

145 150 155 160 



Gly Lys Lys Gly His Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 
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Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Ser Thr Gly Gly Thr Gin Gly Thr Gin Gin Leu Leu 
450 455 460 



Phe Ser Gin Ala Gly Pro Ala Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 
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Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Thr Asn Gly Ala 
580 585 590 



Pro He Val Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala lys He 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin He Leu Val Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 95 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone AAV8 
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<400> 95 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Gin Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Sly Glu Pro 
180 185 190 



Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 
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lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 

245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ala Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Thr Tyr 
405 410 415 



Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ser Arg Thr Gin Thr Thr Gly Gly Thr Ala Asn Thr Gin Thr Leu Gly 
450 455 460 



Phe Ser Gin Gly Gly Pro Asn Thr Met Ala Asn Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Gly 
485 490 495 
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Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly He Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Asn Gly He Leu lie 
530 535 540 



Phe Gly Lys Gin Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Glu Tyr Gly He Val Ala Asp Asn Leu Gin Gin Gin Asn Thr Ala 
580 585 590 



Pro Gin He Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 

645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Asn Gin Ser Lys Leu Asn Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
590 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Val Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 
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<210> 96 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone 43.21 
<400> 96 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 
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Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr E>he Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr lie Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Arg Val Phe Thr Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 ^ 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 475 480 
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Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Ser 

485 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 



Thr Asp Glu Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala He Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val He Pro Gly Met Val Trp Gin 
595 600 605 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe He Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 720 



Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 
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<210> 97 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone 43,25 
<400> 97 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 

100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 
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Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr He Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Asn 
485 490 495 
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Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 



Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala lie Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val He Pro Gly Met Val Trp Gin 
595 600 605 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe He Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 720 



Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 
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<210> 98 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone 43.23 
<400> 98 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 
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Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr He Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp Leu Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 ^ 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Pro Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 415 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Asn 
485 490 495 
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Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 



Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala lie Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val lie Pro Gly Met Val Trp Gin 
595 600 605 



Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 



Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe lie Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 720 



Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 
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<210> 99 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone 43.20 
<400> 99 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 30 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Leu Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 
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Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 

Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr lie Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Thr Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Asn 
485 490 495 
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Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 



Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala lie Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val lie Pro Gly Met Val Trp Gin 
595 600 605 



Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 



Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe lie Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 720 



Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 100 

<211> 736 

<212> PRT 

<213> caps id protein of AAV serotype, clone AAV9 
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400> 100 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Glu Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



264 



EP1 310 571 A2 



Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 

260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn. Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr lie Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Asn 
485 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 
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Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu He Phe Gly 
530 535 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 



Thr Asp Glu Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala He Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val He Pro Gly Met Val Trp Gin 
595 600 605 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe He Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 720 



Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 101 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 24.1 

<400> 101 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 
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Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Arg Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Val Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Ser Tyr Ser 
260 265 270 
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Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 



Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met lie 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Val His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 

515 520 525 
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Pro lie Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu lie Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Cys Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 

725 



<210> 102 
<211> 728 
<212> PRT 

<213> capsid protein of T^V serotype, clone 42.2REAL 
<400> 102 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 
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Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 

130 135 140 



Pro He Glu Ser Pro Asp Ser Ser Thr Gly He Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 
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Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 



Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 
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Pro He Asn Gly Val Leu Vai Phe Gly Glu Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 

565 570 575 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro He Trp Ala Lys He Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He Leu 
625 630 635 640 



He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 570 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 71C 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 103 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 7.2VP1 
<400> 103 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Gly Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 
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Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Arg Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Asn Gly Gin 
145 150 155 160 



Pro Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 
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Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 



Thr Asn Asp Gly Val Thr Thr lie Al^ Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met lie 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asp Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn lie Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 
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Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu lie Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu lie Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 104 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 27 . 3VP1 
<400> 104 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 
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EP1 310 571 A2 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Ser Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 
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EP1 310 571 A2 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 

290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 



Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met lie 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Cys Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Val 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn lie Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Leu 
515 520 525 



Pro lie Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 
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EP1 310 571 A2 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Arg Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro He Trp Ala Glu He Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He Leu 
625 630 635 640 



He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 105 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 16.3VP1 
<400> 105 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 
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EP1 310 571 A2 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 
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EP1 310 571 A2 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 



Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Met Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Gly Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 
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EP1 310 571 A2 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Gly Val Phe Thr Pro 
645 650 655 



Ala Leu Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 106 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.10 
<400> 106 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 
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EP1 310 571 A2 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Giy Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Arg Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 
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EP1 310 571 A2 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 



Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met lie 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 
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EP1 310 571 A2 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu lie Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro lie Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 107 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype^ clone 42. 3B 
<400> 107 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 
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EP1 310 571 A2 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 
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EP1 310 571 A2 



Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Thr Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 
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EP1 310 571 A2 



Gin Ser Ser Thr Aia Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu lie Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro lie Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 108 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.11 
<400> 108 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 
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EP1 310 571 A2 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 
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EP1 310 571 A2 



Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr He 

325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met lie 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Arg Gin 
465 470 475 480 



Arg Leu Ser Lys Asp He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 
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EP1 310 571 A2 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu lie Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro lie Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 109 
<211> 729 

<212> PRT 

<213> capsid protein of AAV serotype, clone FlVPl 
<400> 109 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 
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EP1 310 571 A2 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Asp Ser Pro Asp Ser Ser Thr Gly lie Gly Lys lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Ser Val Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 

245 250 255 



Ser Ser Ser Ser Ser Gly Ala Thr Asn Asp Asn His Tyr Phe Gly Tyr 
260 265 270 



Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe 
275 280 285 



Ser Pro Arg Asp Trp Gin Arg Leu' lie Asn Asn Asn Trp Gly Phe Arg 
290 295 300 



Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val 
305 310 315 320 



Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr 
325 330 335 
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Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly 
340 345 350 



Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met 
355 360 355 



lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val 
370 375 380 



Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu 
385 390 395 400 



TVrg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser Phe Glu Asp Val Pro 
405 410 415 



Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn 
420 425 430 



Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr 
435 440 445 



Thr Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr 
450 455 460 



Met Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin 
455 470 475 480 



Gin Gly Leu Ser Lys Asn Leu Asp Phe Asn Asn Asn Ser Asn Phe Ala 
485 490 495 



Trp Thr Ala Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr 
500 505 510 



Asn Pro Gly He Pro Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe 
515 520 525 



Phe Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn 
530 535 540 



Lys Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys 
545 550 555 560 



Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn 
565 570 575 



Leu Gin Pro Ser Thr Ala Gly Pro Gin Ser Gin Thr He Asn Ser Gin 
580 585 5S0 
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Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin 
595 600 605 



Gly Pro He Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro 
610 615 620 



Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He 
625 630 635 640 



Leu He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr 
645 650 655 



Pro Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val 
660 665 670 



ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp 
675 680 685 



Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val 
690 695 700 



Glu Phe Ala Val Asn Pro Asp Gly Val Tyr Thr Glu Pro Arg Pro He 
705 710 715 720 



Gly Thr Arg Tyr Leu Pro Arg Asn Leu 
725 



<210> 110 
<211> 729 
<212> PRT 

<213> capsid protein of AAV serotype, clone F5VP103 
<400> 110 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 
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Asp Ala Glu PhG Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Asp Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Ser Val Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Thr Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Ser Ser Ser Gly Ala Thr Asn Asp Asn His Tyr Phe Gly Tyr 
260 265 270 



Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe 
275 280 285 



Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg 
290 295 300 



Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val 
305 310 315 320 



Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr 
325 330 335 



Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly 
340 345 350 
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Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met 
355 360 365 



He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val 
370 375 380 



Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu 
385 390 395 400 



Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser Phe Glu Asp Val Pro 
405 410 415 



Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn 
420 425 430 



Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr 
435 440 445 



Thr Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr 
450 455 460 



Met Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin 
465 470 475 480 



Gin Arg Leu Ser Lys Asn Leu Asp Phe Asn Asn Asn Ser Asn Phe Ala 
485 490 495 



Trp Thr Ala Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr 
500 505 510 



Asn Pro Gly He Pro Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe 
515 520 525 



Phe Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn 
530 535 540 



Lys Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys 
545 550 555 560 



Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn 
565 570 575 



Leu Gin Ser Ser Thr Ala Gly Pro Gin Ser Gin Thr He Asn Ser Gin 
580 585 590 



Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin 
595 600 605 
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Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro 
610 615 620 



Ser Pro Leu Met Gly Gly Phe Gly Leu Glu His Pro Pro Pro Gin lie 
625 630 635 640 



Leu lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr 
645 650 655 



Pro Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val 
660 665 670 



Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp 
675 680 685 



Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val 
690 695 700 



Glu Phe Ala Val Asn Pro Asp Gly Val Tyr Thr Glu Pro Arg Pro He 
705 710 715 720 



Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 111 
<211> 729 
<212> PRT 

<213> capsid protein of AAV serotype, clone F3VP1 
<400> 111 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 
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Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Gly Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Ser Val Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 

245 250 255 



Ser Ser Ser Ser Ser Gly Ala Thr Asn Asp Asn His Tyr Phe Gly Tyr 
260 265 270 



Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe 
275 280 285 



Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg 
290 295 300 



Pro Lys Lys Leu Arg Phe Lys Leu Leu Asn lie Gin Val Lys Glu Val 
305 310 315 320 



Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr 
325 330 335 



Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly 
340 345 350 
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Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met 
355 360 365 



lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asp Asn Gly Ser Gin Ser Val 
370 375 380 



Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu 
385 390 395 400 



Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser Phe Glu Asp Val Pro 
405 410 415 



Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn 
420 425 430 



Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr 
435 440 445 



Thr Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr 
450 455 460 



Met Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin 
465 470 475 480 



Gin Arg Leu Ser Lys Asn Leu Asp Phe Asn Asn Asn Ser Asn Phe Ala 
485 490 495 



Trp Thr Ala Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr 

500 505 510 



Asn Pro Gly lie Pro Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe 
515 520 525 



Phe Pro lie Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn 
530 535 540 



Lys Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu lie Lys 
545 550 555 560 



Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn 
565 570 575 



Leu Gin Ser Ser Thr Ala Gly Pro Gin Ser Gin Thr lie Asn Ser Gin 
580 585 590 



Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin 
595 600 605 
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Gly Pro lie Trp Ala Lys He Pro His Thr Asp Gly Asn Phe His Pro 
610 615 620 



Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He 
625 630 635 640 



Leu He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr 
645 650 655 



Pro Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val 
660 665 670 



Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp 
675 680 685 



Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val 
690 6S5 700 



Glu Phe Ala Val Asn Pro Asp Gly Val Tyr Thr Glu Pro Arg Pro He 
705 710 715 720 



Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 112 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42 . 6B 
<400> 112 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 
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Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Arg Lys Leu Arg Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Thr Asp Asp Gly Val Thr Thr He Ala 
325 330 335 



Asn Asn Leu Thr ser Thr He Gin Val Phe Ser Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 
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Pro Ala Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Thr Phe Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ala Arg Thr Gin Ser Thr Thr Gly Ser Thr Arg Glu Leu Gin Phe His 
450 455 460 



Gin Ala Gly Pro Asn Thr Met Ala Glu Gin Ser Lys Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Leu Ser Lys Asn lie Asp Ser Asn 
485 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn 
500 505 510 



Gly Arg Asn Ser Leu Thr Asn Pro Gly Val Ala Met Ala Thr Asn Lys 
515 520 525 



Asp Asp Glu Asp Gin Phe Phe Pro lie Asn Gly Val Leu Val Phe Gly 
530 535 540 



Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met Thr 
545 550 555 560 



Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr 

565 570 575 



Gly Val Val Ser Ser Asn Leu Gin Ser Ser Thr Ala Gly Pro Gin Thr 
580 585 590 



Gin Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys He Pro His Thr 
610 615 620 



301 



EP1 310 571 A2 



Asp Gly Asn Phe His Pro Ser Pro Leu Met Asp Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin lie Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 6S5 700 



Ala Lys Ser Asn Asn Val Glu Phe Ala Val Asn Asn Glu Gly Val Tyr 
705 710 715 720 



Thr Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 113 
<211> 685 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.12 
<400> 113 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 
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Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



lie Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr lie Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr lie Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



303 



EP1 310 571 A2 



Pro Ala Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Thr Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ala Arg Thr Gin Ser Thr Thr Gly Ser Thr Arg Gly Leu Gin Phe His 
450 455 460 



Gin Ala Gly Pro Asn Thr Met Ala Glu Gin Ser Lys Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Leu Ser Lys Asn lie Asp Ser Asn 
485 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn 

500 505 510 



Gly Arg Asn Ser Leu Thr Asn Pro Gly Val Ala Met Ala Thr Asn Lys 
515 520 525 



Asp Asp Glu Asp Gin Phe Phe Pro lie Asn Gly Val Leu Val Phe Gly 
530 535 540 



Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met Thr 
545 550 555 560 



Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr 
565 570 575 



Gly Val Val Ser Ser Asn Leu Gin Ser Ser Thr Ala Gly Pro Gin Thr 
580 585 590 



Gin Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr 
610 615 620 
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Asp Gly Asn Ehe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin lie Leu lie Lys Tyr Thr Ser Asn Tyr Tyr Lys 
645 650 655 



Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu 
660 665 670 



Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
675 680 685 



<210> 114 
<211> 724 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV5CAP 

<400> 114 

Met Ser Phe Val Asp His Pro Pro Asp Trp Leu Glu Glu Val Gly Glu 
15 10 15 



Gly Leu Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 
20 25 30 



Pro Asn Gin Gin His Gin Asp Gin Ala Arg Gly Leu Val Leu Pro Gly 
35 40 45 



Tyr Asn Tyr Leu Gly Pro Gly Asn Gly Leu Asp Arg Gly Glu Pro Val 
50 55 60 



Asn Arg Ala Asp Glu Val Ala Arg Glu His Asp lie Ser Tyr Asn Glu 
65 70 75 80 



Gin Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 
85 90 95 



Ala Glu Phe Gin Glu Lys Leu Ala Asp Asp Thr Ser Phe Gly Gly Asn 
100 105 110 



Leu Gly Lys Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro Phe 
115 120 125 



Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Thr Gly Lys Arg He 
130 135 140 



Asp Asp His Phe Pro Lys Arg Lys Lys Ala Arg Thr Glu Glu Asp Ser 
145 150 155 160 



Lys Pro Ser Thr Ser Ser Asp Ala Glu Ala Gly Pro Ser Gly Ser Gin 
165 170 175 
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10 



35 



40 



Gin Leu Gin lie Pro Ala Gin Pro Ala Ser Ser Leu Gly Ala Asp Thr 
180 185 190 



Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gin Gly Ala 
195 200 205 



Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys Asp Ser Thr Trp 
210 215 220 



Met Gly Asp Arg Val Val Thr Lys Ser Thr Arg Thr Trp Val Leu Pro 
15 225 230 235 240 

Ser Tyr Asn Asn His Gin Tyr Arg Glu lie Lys Ser Gly Ser Val Asp 
245 250 255 

20 

Gly Ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
260 265 270 

Phe Asp Phe Asn Arg Phe His Ser His Tip Ser Pro Arg Asp Trp Gin 
25 275 280 285 

Arg Leu lie Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg val 
290 295 300 

30 

Lys lie Phe Asn lie Gin Val Lys Glu Val Thr Val Gin Asp Ser Thr 
305 310 315 320 

Thr Thr lie Ala Asn Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp 
325 330 335 

Asp Asp Tyr Gin Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly Cys 
340 345 350 

leu Pro Ala Phe Pro Pro Gin Val Phe Thr Leu Pro Gin Tyr Gly Tyr 
355 360 365 

Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 
370 375 380 

Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 
385 390 395 400 

50 

Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe Kis Ser Ser 
405 410 415 

55 Phe Ala Pro Ser Gin Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 

420 425 430 
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Gin Tyr Leu Tyr Arg Phe Val Ser Thr Asn Asn Thr Gly Gly Val Gin 
435 440 445 



Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 
450 455 460 



Phe Pro Gly Pro Met Gly Arg Thr Gin Gly Trp Asn Leu Gly Ser Gly 
465 470 475 480 



Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 
485 490 495 



Leu Glu Gly Ala Ser Tyr Gin Val Pro Pro Gin Pro Asn Gly Met Thr 
500 505 510 



Asn Asn Leu Gin Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met lie 
515 520 525 



Phe Asn Ser Gin Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 
530 535 540 



Gly Asn Met Leu lie Thr Ser Glu Ser Glu Thr Gin Pro Val Asn Arg 
545 550 555 560 



Val Ala Tyr Asn Val Gly Gly Gin Met Ala Thr Asn Asn Gin Ser Ser 
565 570 575 



Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gin Glu lie Val Pro 
580 585 590 



Gly Ser Val Trp Met Glu Arg Asp Val Tyr Leu Gin Gly Pro lie Trp 
595 600 605 



Ala Lys lie Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 
610 615 620 



Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu lie Lys Asn 
625 630 635 640 



Thr Pro Val Pro Gly Asn lie Thr Ser Phe Ser Asp Val Pro Val Ser 
645 650 655 



Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Thr Val Glu Met Glu 
660 665 670 



Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin 
675 680 685 
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10 



30 



35 



40 



45 
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Tyr Thr Asn Asn Tyr Asn Asp Pro Gin Phe Val Asp Phe Ala Pro Asp 
690 695 700 



Ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro lie Gly Thr Arg Tyr Leu 
705 710 715 720 



Thr Arg Pro Leu 



<210> 115 
15 <211> 9 



<212> DNA 

<213> Drain restriction enzyme site 



<400> 115 
20 caccacgtc 



<210> 116 

<211> 28 

<212> DNA 

25 <213> AV2cas 



55 



<400> 116 

cgcagagacc aaagttcaac tgaaacga 28 



<210> 117 
<211> 255 
<212> DNA 

<213> adeno-associated virus serotype 10 

<400> 117 

ggtaattcct ccggaaattg gcattgcgat tccacatggc tgggcgacag agtcatcacc 60 
accagcaccc gaacctgggt cctgcccacc tacaacaacc acatctacaa gcaaatctcc 120 
agcgagacag gagccaccaa cgacaaccac tacttcggct acagcacccc ctgggggtat 180 
tttgacttta acagattcca ctgccacttt tcaccacgtg actggcagcg actcatcaac 240 
aacaactggg gattc 255 



<210> 118 
<211> 258 
<212> DNA 

<213> adeno-associated virus serotype 11 
<400> 118 

ggtaattcct ccggaaattg gcattgcgat tccacatggc tgggcgacag agtcatcacc 60 
accagcaccc gaacctgggc cctgccaacc tacaacaacc acctctacaa acaaatctcc 120 
agcgcttcaa cgggggccag caacgacaac cactactttg gctacagcac cccctggggg 180 
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tattttgact 


ttaacagatt 


ccactgccac 


ttctcaccac 


gtgactggca 


gcgactcatc 


240 


5 


aacaacaact 


ggggattc 










258 


10 


<210> 119 
<211> 255 
<212> DNA 

<213> adeno-associated virus serotype 12 










<400> 119 
ggtaattcct 


ccggaaattg 


gcattgc gat 


tccacatggc 


tgggcgaccg 


agtcattacc 


60 


15 


accagcaccc 


ggacttgggc 


cctgcccacc 


tacaacaacc 


acctctacaa 


gcaaatctcc 


120 




agccaatcgg 


gtgccaccaa 


cgacaaccac 


tacttcggct 


acagcacccc 


ttgggggtat 


180 




tttgatttca 


acagattcca 


ctgccatttc 


tcaccacgtg 


actggcagcg 


actcatcaac 


240 


20 


aacaactggg 


gattc 










255 


25 


<210> 120 
<211> 2205 
<212> DNA 

<213> adeno-associated virus serotype, clone AS.lypl 








<400> 120 
atggctgccg 


atggttatct 


tccagattgg 


ctcgaggaca 


ctctctctga 


aggaatcaga 


60 


30 


cagtggtgga 


agctcaaacc 


tggcccacca 


ccgccgaaac 


ctaaccaaca 


acaccgggac 


120 




gacagtaggg 


gtcttgtgct 


tcctgggtac 


aagtacctcg 


gacccttcaa 


cggactcgac 


180 




aaaggagagc 


cggtcaacga 


ggcagacgcc 


gcggccctcg 


agcacgacaa 


agcctacgac 


240 


35 


caccagctca 


agcaagggga 


caacccgtac 


ctcaaataca 


accacgcgga 


cgctgaattr 


300 




caggagcgtc 


ttcaagaaga 


tacgtctttc 


gggggcaacc 


tcgggcgagc 


agtcttccag 


360 




gccaaaaaga 


gggtactcga 


gcctcttggt 


ctggttgagg 


aagctgttaa 


gacggctcc- 


42 0 


An 


ggaaaaaaga 


gacctataga 


gcagtctcct 


gcagaaccgg 


actcttcctc 


gggcatcggc 


480 




aaatcaggcc 


agcagcccgc 


taagaaaaga 


ctcaattttg 


gtcagactgg 


cgacacagag 


540 




tcagtcccag 


accctcaacc 


aatcggagaa 


ccccccgcag 


ccccctctgg 


tgtgggatct 


600 


45 


aatacaatgg 


cttcaggcgg 


tggggcacca 


atggcagaca 


ataacgaagg 


cgccgacgga 


660 




gtgggtaatt 


cctcgggaaa 


ttggcattgc 


gattccacat 


ggatgggcga 


cagagttatc 


720 




accaccagca 


caagaacctg 


ggccctcccc 


acctacaata 


atcacctcta 


caagcaaatc 


780 


50 


tccagcgaat 


cgggagccac 


caacgacaac 


cactacttcg 


gctacagcac 


cccctggggg 


840 




tattttgact 


ttaacagatt 


ccactgtcac 


ttctcaccac 


gtgactggca 


gcgactcatc 


900 




aacaacaact 


ggggatttag 


acccaagaaa 


ctcaatttca 


agctcttcaa 


catccaagtc 


960 


55 


aaggaggtca 


cgcagaatga 


tggaaccacg 


accatcgcca 


ataaccttac 


cagcacggtg 


1020 
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caggtcttca 


cagactctga 


gtaccagctg 


ccctacgtcc 


tcggttcggc 


tcaccagggc 


J-Uo U 


5 


tgccttccgc 


cgttcccagc 


agacgtcttc 


atgatt cc t c 


agtacggcta 


+^ ^ #^ ^ /V 

CL, Uya.CT,CL.g 






aacaatggca 


gccaagcggt 


aggacgttct 


teat tc tact 


gt c tagagt a 




1 0 n n 




cagatgctga 


ggacgggaaa 


caacttcacc 


+^ 4— ^ ^ a o 

^ucaycuaco. 








10 


cacagcagc t 


acgcgcacag 


ccagagtctg 


gatcgg c t ga 


^ a a 4*" +" ^ +* 

ugaaTirCCL-C'U 


^ +- /-f A ^ /-« ^ /*f 

c<aucgacca.g 






t accrgtatt 


acctgagcaa 


aactcagggt 


acaagu ggao. 










cagtt cagcc 


aagctgggcc 


tagctccatg 


gctcagcagg 


ccaaaaactg 


+^ ^ ^ y-*- -N 

uaccggga 


Xfr*± U 


15 


cccagctacc 


gacagcagcg 


aatgtctaag 


acggct aatg 


acaacaacaa 


cagt gaat tt 






gcttggactg 


cagccaccaa 


atattacctg 


aatggaagaa 


attct ctggt 


caat cccggg 


XOoU 




cccccaatgg 


ccagtcacaa 


ggacgatgag 


gaaaagtatt 


tccccatgca 


cggaaatc tc 




20 


atctttggaa 


aacaaggcac 


aggaactacc 


aatgtggaca 


uugaa tcagI, 


y C ^ I, ax* If o C a 


X DO u 




gacgaagaag 


aaatcagaac 


aactaatcct 


gtggcc acag 


aacaa uacgg 


aca.ggui<gc<^ 


X / f* VJ 




accaaccatc 


agagtcagaa 


caccacagct 


tcctat ggaa 


gt gtggacag 


ccagggaatc 


XoU u 




t tacctggaa 


tggtgtggca 


ggaccgcgat 


gt ctat ct t c 


aaggt cccat 


tt gggc caaa 


XODVJ 




actcctcaca 


cggacggaca 


ctt tcatcct 


tctccgctca 


tgggaggctt 


tggactgaaa 


1 y^i u 




caccctcctc 


cccagatcct 


gatcaaaaac 


acacctgtgc 


cagcgaatcc 


cgcgaccact 


1980 


30 


ttcactcctg 


gaaagtttgc 


ttcgttcatt 


acccagtatt 


ccaccggaca 


ggtcagcgtg 


2040 




gaaatagagt 


gggagctgca 


gaaagaaaac 


agcaaacgct 


ggaacccaga 


aattcagtac 


2100 




acctccaact 


acaacaagtc 


ggtgaatgtg 


gagttt accg 


tggacgcaaa 


cggtgtttat 


2160 


35 


tctgaacccc 


gccctattgg 


cactcgttac 


cttacccgga 


acttg 




2205 



40 Claims 



1 . A method of detecting adeno-associated virus (AAV) sequences in a sample, said metliod comprising the steps of: 

(a) subjecting a sample containing DNAto amplification via polymerase chain reaction (PGR) using a first set 
45 of primers which specifically amplify a first region of AAV nucleic acid sequences; 

(b) optionally subjecting the DNA to further amplification using a second set of primers which specifically 
amplify a second region which comprises the first region of AAV sequences and sequences which are 5' to 
the first region, such that AAV 5' extension sequences which anneal to the 5' end of the AAV sequences 
amplified by the primers for the first region are obtained; 

50 (c) optionally subjecting the DNA to further amplification using a third set of primers which specifically amplify 

a third region which comprises the first region of AAV sequences and sequences which are 3' to the first region, 
such that AAV 3' extension sequences which anneal to the 3' end of the AAV sequences amplified by the 
primers for the first region are obtained, 



55 wherein each of said regions is predetermined based upon the alignment of the nucleic acid sequences of 

at least two AAV serotypes and each of said regions comprises nucleic acid sequences which are highly conserved 
over at least 18 base pairs at the 5' end, optionally variable sequences in the middle, and sequences which are 
highly conserved over at least 18 base pairs at the 3' end of the sequences of the region, relative to the sequences 
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of the at least two aligned AAV serotypes; and 

wherein each of the sets of primers consist of a 5' prinner and a 3' prinner; 

wherein the presence of annplified sequences indicates the presence of an AAV in the sample. 

5 2. A method according to claim 1 , wherein steps (b) and (c) are performed, and further comprising the step of: 

(d) using the amplified sequences to construct a sequence comprising a partial and/or complete AAV gene, 
thereby isolating an AAV gene sequence from the sample. 

10 3. The method according to claim 2, wherein the first region is in the AAV capsid, the second region extends to the 
3' end of the rep genes, and the third region extends to 5' of the AAV 3' ITR, such that step (d) permits the con- 
struction of AAV gene sequences comprising the complete AAV capsid gene. 

4. The method according to claim 2, further comprising the step of subjecting the DNA to amplification with a fourth 
15 set of primers which specifically amplifies a fourth region which extend 5' to the third region. 

5. The method according to claim 4, wherein the fourth region extends to 3' of the AAV 5' ITR, such that step (d) 
permits the construction of AAV gene sequences comprising the complete AAV rep and cap genes. 

20 6. The method according to claim 1 or claim 2, wherein said sample comprises cellular or genomic DNA. 

7. The method according to claim 1 or claim 2, wherein said DNA has been extracted from the group consisting of 
cells, cell culture, tissue, tissue culture, and biological fluids. 

25 8. The method according to claim 1 or claim 2, wherein the first region is about 250 base pairs in length. 

9. The method according to claim 1 or 2, wherein the first region is highly conserved over at least about 25 base 
pairs at the 5' and/or 3' end of the region. 

30 10. The method according to claim 1 or 2, wherein the first region is highly conserved over at least about 30 base 
pairs at the 5' and/or 3' end of the region. 

11. The method according to claim 1 or claim 2, wherein the highly conserved sequences of the first region have at 
least 80% identity among the aligned AAV serotypes at the 5' and/or 3' end of the region. 

35 

12. The method according claim 11 , wherein the highly conserved sequences of the first region have at least 90% 
identity among the aligned AAV serotypes at the 5' and/or 3' end of the region. 

13. The method according to claim 1 or claim 2, wherein the variable sequences in the middle of the first region have 

40 less than 70% identity among the aligned AAV serotypes. 

14. The method according to claim 1 or claim 2, wherein the first region spans about bp 2800 to about 3200 of AAV 
1 , SEQ ID NO:6, and corresponding base pairs in other AAV serotypes. 

45 15. The method according to claim 14, wherein the first region is 257 bp spanning bp 2886 to about 3143 bp of AAV 
1 , SEQ ID NO:6, and corresponding base pairs in other AAV serotypes. 

16. The method according to claim 1 , wherein the primers are AVI ns and AV2cas. 

50 17. Amethodof identifying the serotype of adeno-associated virus (AAV) sequences in a sample as known or unknown, 
said method comprising the steps of: 

(a) obtaining an enzymatic digestion analysis of a sample containing DNA molecules comprising AAV se- 
quences which span all or a portion of bp 2886 to 3143 of AAV1 , SEQ ID N0:1 , and corresponding regions 

55 of other AAV serotypes; and 

(b) comparing the enzymatic digestion analysis from the sample to enzymatic digestion analysis for corre- 
sponding regions of one or more AAV serotypes, thereby identifying the AAV sequences in the sample as 
being from one of the one or more AAV serotypes or from an unknown serotype. 
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18. The method according to claim 1 7, wherein the AAV sequences were obtained by the method of claim 1 . 

19. A diagnostic kit for detecting the presence of an unknown adeno-associated virus (AAV) in a sample, said kit 
comprising: 

5 

(a) a first set of primers which specifically amplify a first region of AAV nucleic acid sequences; 

(b) optionally a second set of primers specific for a second region of the AAV nucleic acid sequences which 
comprises the first region of AAV sequences and sequences which are 5' to the first region, such that AAV 5' 
extension sequences which anneal to the 5' end of the AAV sequences amplified by the primers for the first 

10 region are obtained; 

(c) optionally a third set of primers which specifically amplify a third region which comprises the first region of 
AAV sequences and sequences which are 3' to the first region, such that AAV 3' extension sequences which 
anneal to the 3' end of the AAV sequences amplified by the primers for the first region are obtained, 

15 wherein each of said regions is predetermined based upon the alignment of the nucleic acid sequences of 

at least two AAV serotypes and each of said regions comprises nucleic acid sequences which are highly conserved 
over at least 18 base pairs at the 5' end, optionally variable sequences in the middle, and sequences which are 
highly conserved over at least 18 base pairs at the 3' end of the sequences of the region, relative to the sequences 
of the at least two aligned AAV serotypes 

20 wherein each of the sets of primers consist of a 5' primer and a 3' primer. 

20. A method for isolating novel adeno-associated viruses (AAV) from a cell, said method comprising the steps of: (a) 
infecting the cell with a virus which provides helper functions to said AAV; (b) isolating infectious clones containing 
AAV; (c) sequencing the isolated AAV; and (d) comparing the sequences of the isolated AAV to known AAV sero- 

25 types, whereby differences in the sequences of the isolated AAV and known AAV serotypes indicates the presence 

of a novel AAV. 

21. The method according to claim 20, wherein said virus is an adenovirus. 

30 22. The method according to claim 21 , wherein said adenovirus is of human or non-human primate origin. 

23. A novel adeno-associated virus (AAV) serotype identified by a method according to claim 2 or claim 20. 

24. An isolated adeno-associated virus (AAV) comprising an AAV capsid having an amino acid sequence selected 
35 from the group consisting of: 

AAV7, amino acids 1 to 737 of SEQ ID NO:2; C1, SEQ ID NO:60; C2, SEQ ID NO:61; C5, SEQ ID NO:62; 
A3-3, SEQ ID NO:66; A3-7, SEQ ID NO:67; A3-4, SEQ ID NO:68; A3-5, SEQ ID NO: 69; 3.3b, SEQ ID NO: 
62; 223.4, SEQ ID NO: 73; 223-5, SEQ ID NO:74; 223-1 0, SEQ ID NO:75; 223-2, SEQ ID NO:76; 223-7, SEQ 

40 ID NO: 77; 223-6, SEQ ID NO: 78; 44-1, SEQ ID NO: 79; 44-5, SEQ ID NO:80; 44-2, SEQ ID NO:81; 42-15, 

SEQ ID NO: 84; 42-8, SEQ ID NO: 85; 42-13, SEQ ID NO:86; 42-3A, SEQ ID NO:87; 42-4, SEQ ID NO:88; 
42-5A, SEQ ID NO:89; 42-1 B, SEQ ID NO:90; 42-5B, SEQ ID NO:91 ; 43-1 , SEQ ID NO: 92; 43-12, SEQ ID 
NO: 93; 43-5, SEQ ID NO:94; 43-21 , SEQ ID NO:96; 43-25, SEQ ID NO: 97; 43-20, SEQ ID NO:99; 24.1 , 
SEQ ID NO: 101; 42.2, SEQ ID NO:102; 7.2, SEQ ID NO: 103; 27.3, SEQ ID NO: 104; 16.3, SEQ ID NO: 105; 

45 42.10, SEQ ID NO: 106; 42-3B, SEQ ID NO: 107; 42-11, SEQ ID NO: 108; F1 , SEQ ID NO: 109; F5, SEQ ID 

NO: 110; F3, SEQ ID NO:111; 42-6B, SEQ ID NO: 112; and 42-12, SEQ ID NO: 113. 

25. A protein comprising a fragment of an AAV capsid protein, said fragment selected from the group consisting of: 

50 vp2 capsid protein, amino acids (aa) 1 38 to 737; 

vp3 capsid protein, aa 203 to 737; 

hypervariable region (HVR)1 through 12: aa 146 to 152; aa 182 to 187; aa262 to 264; aa 263 to 266; aa 263 
to 266; aa 381 to 383; aa 383 to 385; aa 450 to 474; aa 451 to 475; aa 490 to 495; aa 491 to 496; aa500 to 
504; aa 501 to 505; aa 51 4 to 522; aa 533 to 554; aa 534 to 555; aa 581 to 594; aa 583 to 596; aa 658 to 667; 
55 aa 660 to 669; and aa 705 to 71 9; aa 707 to 772; 

aa 24-42, aa 25-28; aa 81-85; aa133 to 165; aa 134 - 165; aa 137 to 143; aa 154 to 156; aa 194 to 208; aa 
261 to 274; aa 262 to 274; aa171 to 173; aa 413 to 41 7; aa 449 to 478; as 494 to 525; as 534 to 571 ; aa 581 
to 601 ; as 660 to 671 ; as 709 to 723; and 
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aa 1 to 1 84, aa 199 to 259; aa 274 to 446; aa 603 to 659; aa 670 to 706; aa 724 to 736; aa 1 85 to 1 98; aa 
260 to 273; aa 447 to 477; aa 495 to 602; aa 660 to 669; and aa 707 to 723, 

wherein the amino acid numbers are those of the AAV7 capsid, SEQ ID N0:2, and corresponding regions 
5 in the capsid of C1 , SEQ ID NO:60; C2, SEQ ID NO:61 ; C5, SEQ ID NO:62; A3-3, SEQ ID NO:66; A3-7, SEQ ID 

NO:67; A3-4, SEQ ID NO:68; A3-5, SEQ ID NO: 69; 3.3b, SEQ ID NO: 62; 223.4, SEQ ID NO: 73; 223-5, SEQ ID 
NO:74; 223-10, SEQ ID NO:75; 223-2, SEQ ID NO:76; 223-7, SEQ ID NO: 77; 223-6, SEQ ID NO: 78; 44-1, SEQ 
ID NO: 79; 44-5, SEQ ID NO:80; 44-2, SEQ ID NO:81 ; 42-15, SEQ ID NO: 84; 42-8, SEQ ID NO: 85; 42-13, SEQ 
ID NO:86; 42-3A, SEQ ID NO:87; 42-4, SEQ ID NO:88; 42-5A, SEQ ID NO:89; 42-1 B, SEQ ID NO:90; 42-5B, 
10 SEQ ID NO:91 ; 43-1 , SEQ ID NO: 92; 43-12, SEQ ID NO: 93; 43-5, SEQ ID NO:94; 43-21 , SEQ ID NO:96; 43-25, 

SEQ ID NO: 97; 43-20, SEQ ID NO:99; 24.1 , SEQ ID NO: 101; 42.2, SEQ ID NO:102; 7.2, SEQ ID NO: 103; 27.3, 
SEQ ID NO: 104; 16.3, SEQ ID NO: 105; 42.10, SEQ ID NO: 106; 42-3B, SEQ ID NO: 107; 42-11, SEQ ID NO: 
108; F1, SEQ ID NO: 109; F5, SEQ ID NO: 110; F3, SEQ ID NO:1 11 ; 42-6B, SEQ ID NO: 112; and 42-12, SEQ 
ID NO: 113. 

15 

26. An artificial adeno-associated virus (AAV) capsid protein comprising one or more of the protein fragments according 
to claim 25. 

27. A recombinant adeno-associated virus (AAV) comprising an artificial capsid according to claim 26. 

20 

28. A molecule comprising a nucleic acid sequence encoding a protein according to claim 25 to 26. 

29. A molecule comprising a nucleic acid sequence encoding a novel adeno-associated virus (AAV) serotype capsid 
protein having an amino acid sequence selected from the group consisting of: AAV7, amino acids 1 to 737 of SEQ 

25 ID NO:2; 01, SEQ ID NO:60; 02, SEQ ID NO:61; 05, SEQ ID NO:62; A3-3, SEQ ID NO:66; A3-7, SEQ ID NO: 

67; A3-4, SEQ ID NO:68; A3-5, SEQ ID NO: 69; 3.3b, SEQ ID NO: 62; 223.4, SEQ ID NO; 73; 223-5, SEQ ID NO: 
74; 223-10, SEQ ID NO:75; 223-2, SEQ ID NO:76; 223-7, SEQ ID NO: 77; 223-6, SEQ ID NO: 78; 44-1, SEQ ID 
NO: 79; 44-5, SEQ ID NO:80; 44-2, SEQ ID NO:81 ; 42-15, SEQ ID NO: 84; 42-8, SEQ ID NO: 85; 42-13, SEQ ID 
NO:86; 42-3A, SEQ ID NO:87; 42-4, SEQ ID NO:88; 42-5A, SEQ ID NO:89; 42-1 B, SEQ ID NO:90; 42-5B, SEQ 

30 ID NO:91 ; 43-1 , SEQ ID NO: 92; 43-12, SEQ ID NO: 93; 43-5, SEQ ID NO:94; 43-21 , SEQ ID NO:96; 43-25, SEQ 

ID NO: 97; 43-20, SEQ ID NO:99; 24.1 , SEQ ID NO: 101; 42.2, SEQ ID NO:102; 7.2, SEQ ID NO: 103; 27,3, SEQ 
ID NO: 104; 16.3, SEQ ID NO: 105; 42.10, SEQ ID NO: 106; 42-3B, SEQ ID NO: 107; 42-11, SEQ ID NO: 108; 
F1 , SEQ ID NO: 109; F5, SEQ ID NO: 110; F3, SEQ ID NO:111 ; 42-6B, SEQ ID NO: 112; and 42-12, SEQ ID NO: 
113. 

35 

30. A molecule comprising a nucleic acid sequence encoding a novel adeno-associated virus (AAV) serotype capsid 
protein, said nucleic acid sequence selected from the group consisting of: AAV5, SEQ ID NO:2; 42-2 , SEQ ID 
NO:9; 42-8 ,SEQ ID NO:27; 42-15 , SEQ ID NO:28; 42-5b ,SEQ ID NO: 29; 42-1b ,SEQ ID NO:30; 42-13, SEQ 
ID NO: 31; 42-3a , SEQ ID NO: 32; 42-4 ,SEQ ID NO:33; 42-5a , SEQ ID NO: 34; 42-10, SEQ ID NO:35; 42-3b, 

40 SEQ ID NO: 36; 42-11 , SEQ ID NO: 37; 42-6b, SEQ ID NO:38; 43-1 , SEQ ID NO: 39; 43-5, SEQ ID NO: 40; 43-12 , 

SEQ ID NO:41 ; 43-20, SEQ ID NO:42; 43-21 , SEQ ID NO: 43; 43-23, SEQ ID NO:44; 43-25 ,SEQ ID NO: 45; 44,1 , 
SEQ ID NO:47; 44.5, SEQ ID NO:47; 223.1 0, SEQ ID NO:48; 223.2, SEQ ID NO:49; 223.4, SEQ ID NO:50; 223,5, 
SEQ ID NO: 51 ; 223.6, SEQ ID NO: 52; 223.7, SEQ ID NO: 53; A3.4, SEQ ID NO: 54; A3.5, SEQ ID NO:55; A3. 7 , 
SEQ ID NO: 56; A3.3 ,SEQ ID NO:57; 42.12, SEQ ID NO: 58; 44.2, SEQ ID NO: 59; AAV10, SEQ ID NO: 117; 

45 AAV11, SEQ ID NO: 118; AAV12 ,SEQ ID NO:119; A3. 1, SEQ ID NO:120; and H6, SEQ ID NO: 25. 

31 . A molecule comprising a nucleic acid sequence encoding a fragment of an adeno-associated virus capsid protein, 
said nucleic acid sequence selected from the group consisting of: 

50 vpl , nt 825 to 3049; 

vp2, nt 1234 to 3049; 
vp3, nt 1434 to 3049; 
nt 468 to 3090; and 
nt 725 to 3090, 

55 

wherein the nucleotides numbers are of AAV7, SEQ ID NO:1 and correspond to sequences in 42-2 , SEQ 
ID NO:9; 42-8 ,SEQ ID NO:27; 42-15 , SEQ ID NO:28; 42-5b ,SEQ ID NO: 29; 42-1b ,SEQ ID NO:30; 42-13, SEQ 
ID NO: 31; 42-3a , SEQ ID NO: 32; 42-4 ,SEQ ID NO:33; 42-5a , SEQ ID NO: 34; 42-10, SEQ ID NO:35; 42-3b, 
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SEQ ID NO: 36; 42-11 , SEQ ID NO: 37; 42-6b, SEQ ID NO:38; 43-1 , SEQ ID NO: 39; 43-5, SEQ ID NO: 40; 43-12 , 
SEQ ID NO:41 ; 43-20, SEQ ID NO:42; 43-21 , SEQ ID NO: 43; 43-23, SEQ ID NO:44; 43-25 ,SEQ ID NO: 45; 44,1 , 
SEQ ID NO:47; 44.5, SEQ ID NO:47; 223.1 0, SEQ ID NO:48; 223.2 , SEQ ID NO:49; 223.4, SEQ ID NO:50; 223.5, 
SEQ ID NO: 51 ; 223.6, SEQ ID NO: 52; 223.7, SEQ ID NO: 53; A3.4, SEQ ID NO: 54; A3.5, SEQ ID NO:55; A3.7 , 
5 SEQ ID NO: 56; A3. 3 ,SEQ ID NO:57; 42.12, SEQ ID NO: 58; 44.2, SEQ ID NO: 59; AAV10, SEQ ID NO: 117; 

AAV11, SEQ ID NO: 118; AAV12 ,SEQ ID NO:119; A3. 1, SEQ ID NO:120; and H6, SEQ ID NO: 25. 

32. The molecule according to claim 26 or claim 28 to 31 , wlierein said molecule is a plasmid. 

10 33. The molecule according to claim 26 or claim 28 to 31 , wherein said molecule further comprises a functional AAV 
rep gene. 

34. The molecule according to claim 30, wherein said nucleic acid sequence is the AAV7 sequence, SEQ ID NO:1 . 

f5 35. A method of generating a recombinant adeno-associated virus (AAV) comprising an AAV serotype capsid com- 
prising the steps of culturing a host cell containing: (a) a molecule according to any of claims 26 or claim 28 to 31 
which encodes an adeno-associated virus capsid; (b) afunctional rep gene; (c) a minigene comprising AAV inverted 
terminal repeats (ITRs) and a transgene; and (d) sufficient helper functions to permit pacl<aging of the minigene 
into the AAV capsid protein. 

20 

36. A host cell transfected with an adeno-associated virus according to claim 24 or a molecule according to any of 
claims 26, or claims 28 to 31 . 

37. A composition comprising an AAV according to claim 24 or claim 27, and a physiologically compatible carrier 

25 

38. A composition comprising a molecule according to any of claims 26, or claims 28 to 31 and a physiologically 

compatible carrier. 

39. A method of delivering a transgene to a cell, said method comprising the step of contacting the cell with an AAV 
30 according to claim 24 or claim 27, wherein said rAAV comprises the transgene. 

40. A molecule comprising a heterologous adeno-associated virus (AAV) serotype 7 nucleic acid sequence, said se- 
quence comprising: 

35 nucleotides (nt) 1 to 107 of SEQ ID NO: 1; 

nt 107to 2215 of SEQ ID NO:1; 

nt 334 to 2215 of SEQ ID NO:1; 

nt 2222 to 4435 of SEQ ID NO: 1 . 

nt 2633 to 4435 of SEQ ID NO:1 ; 
40 nt 2831 to 4435 of SEQ ID NO:1 ; and 

nt 4704 to 4721 of SEQ ID NO: 1 . 

41. A molecule encoding an adeno-associated virus (AAV) serotype 7 rep protein or a fragment thereof, said protein 
or fragment selected from the group consisting of: 



45 



50 



amino acid (aa) 1 to 623, aa 1 to 1 71 ; aa 172 to 372, aa 373 to 444, and aa 445 to 623 of SEQ ID NO:3. 
42. A host cell containing a molecule according to claim 40 or 41 . 
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GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 



AGAAATGCAA 
AGAAATGCAA 
AGAAATGCAG 
AGAAATGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AGAAATGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 



GTCTTCCGCC 
GTCTTCCGCC 
GTCTTCCGCC 
GTCTTCCGCC 
GTCGTCCGCC 
GTCGTCCGCC 
GTCGTCCGCC 
GTCATCGGCC 
GTCGTCCGCC 
GTCGTCCGCC 
GTCGTCCGCC 
GTCGTCCGCC 



CAGATCGACC 
CAGATCGACC 
CAGATCGACC 
CAGATCGACC 
CAGATCGACC 
CAGATCGACC 
CAGATAGACC 
CAGATCGAAC 
CAGATCGACC 
CAGATCGACC 
CAGATCGACC 
CAGATCGACC 



CGACTCCGGT 
CGACTCCGGT 
CGACTCCGGT 
CGACTCCGGT 
CCACCCCCGT 
CCACCCCCGT 
CGACTCCGGT 
CCACTCCCGT 
CCACCCCCGT 
CCACTCCCGT 
CCACCCCCGT 
CCACCCCCGT 



GATTGTCACC 
GATTGTCACC 
GATTGTCACC 
GATTGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 



346 



EP1 310 571 A2 



4 2_2 
4 2_8 
42_15 
42_5b 
42_lb 
42__13 
42_3a 
42__4 
42__5a 
42_10 
42__3b 
42_11 
42_6b 
4 3_1 
43_5 
43_12 
4 3_20 
43__21 
43_^23 
4 3_2 5 
44_1 
4 4__5 
223_^10 
223_2 
223_4 
223^5 
223__6 
223_7 
A3_4 
A3_5 
A3_7 
A3_3 
42_12 
AAVl 
AAV2 
AAV 3 
AAV8 
AAV9 
AAV7 
44 2 



Fig . lAG 



1601 

TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 


ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 


TGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 


GGGAACAGCA 

GGGAACAGCA 
GGGTVACAGCA 
GGGAACAGCA 


1650 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 


TCCAACACCA 
TCCAACACCA 


ACATGTGCGC 
ACATGTGCGC 


CGTGATTGAC 
CGTGATTGAC 


GGGAACAGCA 
GGGAACAGCA 


CCACCTTCGA 
CCACCTTCGA 


TCCAACACCA 


ACATGTGCGC 


CGTGATTGAC 


GGGAACAGCA 


CCACCTTCGA 





TCCAACACCA 
GCGCAGGATC 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 



ACATGTGCGC 
ACGTGACCGA 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 



CGTGATTGAC 
GGTGGCGCAT 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 



GGGAACAGCA 
GAGTTCTACG 
GGGAACAGCA 
GGGAACAGCA 
GGGAACAGCA 
GGGAACAGCG 
GGGAACAGCA 
GGGAACAGCA 
GGGAACAGCA 
GGGAACAGCA 
GGGAACAGCA 



CCACCTTCGA 
TCAGAAAGGG 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 



TCTAACACCA 
TCTTVACACCA 
TCTAACACCA 
TCTAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 
TCCAACACCA 



ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 
ACATGTGCGC 



CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 
CGTGATTGAC 



GGAAACTCGA 
GGAAACTCGA 
GGAAACTCGA 
GGAAACTCGA 
GGGAACAGCA 
GGGAACAGCA 
GGGAACTCAA 
GGGAACAGCA 
GGGAACAGCA 
GGGAACAGCA 
GGGAACAGCA 
GGGAACAGCA 



CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
CCACCTTCGA 
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Fig. lAH 



1651 

GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 


CCGTTACAAG 
CCGTTACAAG 

CCGTTGCAGG 
CCGTTACAAG 


ACCGGATGTT 
ACCGGATGTT 

ACCGGATGTT 
ACCGGATGTT 


CAAATTTGAA 
CAAATTTGAA 

CAAATTTGAA 
CAAATTTGAA 


1700 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 


GCACCAGCAG 
GCACCAGCAG 


CCGTTACAAG 
CCGTTACAAG 


ACCGGATGTT 
ACCGGATGTT 


CAAATTTGAA 
CAAATTTGAA 


CTCACCCGCC 
CTCACCCGCC 


GCACCAGCAG 


CCGTTGCAGG 


ACCGGATGTT 


CAAATTTGAA 


CTCACCCGCC 





GCACCAGCAG CCGTTACAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 
TGGAGCCAAC AAGAGACCCG CCCCCGATGA CGCGGATAAA AGCGAGCCCA 
GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTCGAA CTCACCCGCC 
GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTCGAA CTCACCCGCC 
GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTCGAA CTCACCCGCC 
GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 
GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 
GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 
GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 
GCACCAGCAG CCGTTGCGGG ACCGGATGTT CAAGTTTGAA CTCACCCGCC 
GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTTGAA CTCACCCGCC 



GCACCAGCAG 

GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
ACACCAGCAG 
GCATCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 



CCGTTGCAAG 

CCGTTGCAAG 
CCGTTGCAAG 
CCGTTGCAAG 
CCGTTACAAG 
CCGTTGCAGG 
CCGTTGCAAG 
CCGCTGCAGG 
CCTCTCCAGG 
CCTCTCCAGG 
CCGTTGCAGG 
CCGTTGCAGG 



ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 



C?vAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
TGAATTTGAA 
TAAGTTCGAA 
TAAGTTCGAA 
CAAATTTGAA 
CAAGTTTGAA 



CTTACCCGCC 
CTTACCCGCC 
CTTACCCGCC 
CTTACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
CTTACCCGCC 
CTCACCCGiCC 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
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Fig. lAI 



1701 

GTCTGGAGCA 
GTCTGGAGCA 

GTCTGGAGCA 
GTCTGGAGCA 


CGACTTTGGC 
CGACTTTGGC 
TGACTTTGGC 
CGACTTTGGC 


7VAGGTGACAA 
AAGGTGACAA 
AAGGTGACAA 
AAGGTGACAA 


AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 


1750 
CAAAGAGTTC 

CAAAGAGTTC 
C7\AAGAGTTC 
CAAAGAGTTC 


GTCTGGAGCA 
GTCTGGAGCA 


TGACTTTGGC 
TGACTTTGGC 


AAGGTGACAA 
AAGGTGACAA 


AGCAGGAAGT 
AGCAGGAAGT 


CAAAGAGTTC 
CAAAGAGTTC 


GTCTGGAGCA 


TGACTTTGGC 


AAGGCGACAA 


AGCAGGAAGT 


CAAAGAGTTC 





GTCTGGAGCA 
AGCGGGCCTG 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 



CGACTTTGGC 
CCCCTCAGTC 
CGACTTTGGC 
CGACTTTGGC 
CGACTTTGGC 
TGACTTTGGC 
TGACTTTGGC 
TGACTTTGGC 
TGACTTTGGC 
CGACTTTGGC 
CGACTTTGGC 



AAGGTGACAA 
GCGGATCCAT 
AAGGTGACCA 
AAGGTGACCA 
AAGGTGACCA 
AAGGTGACCA 
AAGGTGACCA 
AAGGTGACCA 
AAGGTGACGA 
AAGGTGACAA 
AAGGTGACAA 



AGCAGGAAGT 
CGACGTCAGA 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 



CAAAGAGTTC 
CGCGGAAGGA 
CAAAGAGTTC 
CAAAGAGTTC 
CAAAGAGTTC 
CAAAGAGTTC 
CAAAGAGTTC 
CAAAGAGTTC 
CAAAGGGTTC 
CAGAGAGTTC 
CAGAGAGTTC 



GTTTGGATCA 
GTTTGGATCA 
GTTTGGATCA 
GTTTGGATCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGATCA 
GTTTGGACCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 
GTCTGGAGCA 



TGACTTTGGC 
TGACTTTGGG 
TGACTTTGGC 
TGACTTTGGG 
CGACTTTGGC 
TGACTTTGGC 
TGACTTTGGG 
TGACTTTGGG 
CGACTTTGGC 
CGACTTTGGC 
CGACTTTGGC 
CGACTTTGGC 



AAGGTGACCA 
AAGGTCACCA 
AAGGTCACCA 
AAGGTCACCA 
AAGGTGACAA 
AAGGTGACAA 
AAGGTCACCA 
AAGGTCACCA 
AAGGTGACAA 
AAGGTGACAA 
AAGGTGACGA 
AAGGTGACAA 



AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AACAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 
AGCAGGAAGT 



CAAAGACTTT 
CAAAGACTTT 
CAAAGACTTT 
CAAAGACTTT 
CAAAGAGTTC 
CAAAGAGTTC 
CAAAGACTTT 
AAAGGACTTT 
CAAAGAGTTC 
CAAAGAGTTC 
CAAAGAGTTC 
CAGAGAGTTC 
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Fig. lAJ 



1751 

TTCCGCTGGG 
TTCCGCTGGG 

TTCCGCTGGG 
TTCCGCTGGG 


CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 


CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 


GTGGCGCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCATG 


1800 
AGTTCTACGT 
AGTTCTACGT 
AGTTCTACGT 
AGTTCTACGT 


TTCCGCTGGG 
TTCCGCTGGG 


CGCAGGATCA 
CGCAGGATCA 


CGTGACCGAG 
CGTGACCGAG 


GTGGCGCATG 
GTGGCGCATG 


AGTTCTACGT 
AGTTCTACGT 


TTCCGCTGGG 


CGCAGGATCA 


CGTGACCGAG 


GTGGCGCATG 


AGTTCTACGT 





TTCCGCTGGG 
GCTCCGGTGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 



CGCAGGATCA 
ACTTTGCCGA 
CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 
CGCAGGATCA 



CGTGACCGAG 
CAGGTACCAA 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 



GTGGCGCATG 
AACAAATGTT 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCACG 
GTGGCGCACG 



AGTTCTACGT 
CTCGTCACGC 
AGTTCTACGT 
AGTTCTACGT 
AGTTCTACGT 
AGTTCCACGT 
AGTTCCACGT 
AGTTCCACGT 
AGTTCCACGT 
AGTTCTACGT 
AGTTCTACGT 



TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGGTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGGTGGG 
TTCCGGTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 
TTCCGCTGGG 



CTCAAGATCA 
CTCAAGATCA 
CTCAAGATCA 
CTCAAGATCA 
CGCAGGATCA 
CGCAGGATCA 
CAAAGGATCA 
CTTCCGATCA 
CCAGTGATCA 
CCAGTGATCA 
CCAGTGATCA 
CGCAGGATCA 



CGTGACTGAG 
CGTGACTGAG 
CGTGACTGAG 
CGTGACTGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGGTTGAG 
CGTGACTGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 
CGTGACCGAG 



GTGGAGCATG 
GTGGAGCATG 
GTGGAGCATG 
GTGGAGCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGAGCATG 
GTGGCTCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCATG 
GTGGCGCACG 



AGTTCTACGT 
AGTTCTACGT 
AGTTCTACGT 
AGTTCTACGT 
AGTTCTACGT 
AGTTCTACGT 
AATTCTACGT 
AGTTCTACGT 
AGTTTTACGT 
AGTTTTACGT 
AGTTCTACGT 
AGTTCTACGT 
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Fig. lAK 



1801 



1850 
P40/TAT^ 



CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGAIA/y^A 

CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGAtAAAA 

CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 

CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 



CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 



CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 



CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 

GGGCATAGCG CTGACGTAAA TCACGTCATA GGGGAGTGGT CCTGTATTAG 
CAGAAAGGGC GGAGCCAGCA AAAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGC GGAGCCAGCA AAAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGC GGAGCCAGCA AAAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGCGCCAACA AGAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 



CAAAAAGGGT 
CAAAAAGGGT 
CAAAAAGGGT 
CAAAAAGGGT 
CAGAAAGGGT 
CAGAAAGGGT 
CAAAAAGGGT 
CAGAAAGGGT 
CAGAAAGGGC 
CAGAAAGGGC 
CAGAAAGGGC 
CAGAAAGGGT 



GGAGCCAACA 
GGAGCCAACA 
GGAGCCAACA 
GGAGCCAACA 
GGAGCCAACA 
GGAGCCAACA 
GGAGCCAACA 
GGAGCTAAGA 
GGAGCCAGCA 
GGAGCCAGCA 
GGAGCCAGCA 
GGAGCCAACA 



AAAGGCCCGC 
TyVAGGCCCGC 
AAAGGCCCGC 
AAAGGCCCGC 
AGAGACCCGC 
AAAGACCCGC 
AAAGACCCGC 
AA.CGCCCCGC 
AAAGACCCGC 
AAAGACCCGC 
AAAGACCCGC 
AGAGACCCGC 



CCCCGATGAT 
CCCCGATGAT 
CCCCGATGAT 
CCCCGATGAT 
CCCCGATGAC 
CCCCGATGAC 
CCCCAGTGAC 
CTCCAATGAC 
CCCCGATGAC 
CCCCGATGAC 
CCCCGATGAC 
CCCCGATGAC 



gtatatataa 
gtatatataa 
gtatatataa 
gtatatataa 
gcggataaaa 
gcggataaaa 
gcagatataa 
gcggatgtaa 
gcggataaaa 
gcggataaaa 
gcggaJtataa 

GCGGAT 



TAAAA 
P4 0/TATA 
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Fig . lAL 



1851 

GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 



GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 



CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 



P40 RNA 

cggatI^catc 
cggatccatc 
cggatccatc 
cggatccatc 



1900 

GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 



GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 



GCGAGCCCAA GCGGGCCCGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 



GCGAGCCCAA 
CTGTCACGTG 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 



GCGGGCCTGC 

AGTGCTTTTG 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 



CCCTCAGTCG 

CGACATTTTG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 



CGGATCCATC 

C. .ATCCATC 
CGGATCCATC 
CGGATCCATC 
CGGATCCATC 
CGGATCCATC 
CGGATCCATC 
CGGATCCATC 
CGGATCCATC 
CGGATCCATC 
CGGATCCATC 



GACGTCAGAC 

GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 



ATGAGCCCAA 

ATGAGCCCAA 
ATGAGCCCAA 
ATGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GTGAGCCCAA 
GCGAGCC7VAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 
GCGAGCCCAA 



GCGGGCGCGC 
GCGGGCGCGC 
GCGGGCGCGC 
GCGGGCGCGC 
GCGGGCCTGC 
GCGGGCCTGC 
ACGGGTGCGC 
ACGGGAGTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 
GCGGGCCTGC 



GAGTCAGTTG 
GAGTCAGTTG 
GAGTCAGTTG 
GAGTCAGTTG 
CCCTCAGTCG 
CCCTCAGTCG 
GAGTCAGTTG 
ACGTCACTTG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 
CCCTCAGTCG 



CGCAGCCATC 

CGCAGCCATC 
CGCAGCCATC 
CGCAGCCATC 
CGGATCCATC 
CGGATCCATC 
CGCAGCCATC 
CGCAGCCGAC 
CGGATCCATC 
CGGATCCATC 
CGGATCCATC 
CGGATjj^CATC 

P4 0 RNA 



GACGTCAGAC 

GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
AACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
GACGTCAGAC 
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Fig. 1AM 



1901 

GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 



CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 



CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 



AGGTACCAAA 
AGGTACC7VAA 
AGGTACCAAA 
AGGTACCAAA 



1950 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 



GCGGAAGGAG 


CTCCGGTGGA 


CTTTGCCGAC 


AGGTACCAAA 


ACAAATGTTC 


GCGGAAGGAG 


CTCCGGTGGA 


CTTTGCCGAC 


AGGTACCAAA 


ACAAATGTTC 


GCGGAAGGAG 


CTCCGGTGGA 


CTTTGCCGAC 


AGGTACCAAA 


ACAAATGTTC 



GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 



CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 



CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 



AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 



ACAAATGTTC 
ACAAGTGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 



GCGGA. . .AG 
GCGGA. . .AG 
GCGGA. . .AG 
GCGGA. . .AG 
GCGGAAGGAG 
GCGGT^GGAG 
GCGGA. . .AG 
GCGGA ... AG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 
GCGGAAGGAG 



CTTCGATAAA 
CTTCGATAAA 

CTTCGATAAA 
CTTCGATAAA 
CTCCGGTGGA 
CTCCGGTGGA 
CTTCGATCAA 
CACCGGCGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 



CTACGCGGGC 
CTACGCGGAC 

CTACGCGGAC 
CTACGCGGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTACGCAGAC 
CTACGCGGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 



AGGTACCAAA 
AGGTACCAAA 

AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCZ^JVA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 



ACAAATGTTC 
ACAAATGTTC 

ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACA7\ATGTTC 
ACAAATGTTC 
ACAA.z\TGTTC 
ACAAATGTTC 
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Fig. IAN 



1951 2000 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 

GAATTC GCCCTT GGCTGCGTC 

TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGCT TCCCTG.CAA GACATGCGAG 

GAATTC GCCCTTTCTA CGGCTGCGTC 

TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACATGCGAG 

GAATTC GCCCTTTCTA CGGCTGCGTC 

GAATTC GCCCTTTCTA CGGCTGCGTC 

TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA 7^ACGTGCGAG 
TCGTCACGCG GGCATGCTTC AGACGCTGTT TCCCTG.CAA AACGTGCGAG 
TCGTCACGCG GGCATGCTCC AGATGCTGTT TCCCTG.CAA AACGTGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACATGCGAG 



TCGTCACGTG 
TCGTCACGTG 
TCGTCACGTG 
TCGTCACGTG 
TCGTCACGCG 
TCGTCACGCG 
TCGTCACGTG 
TCGTCACGTG 
TCGTCACGCG 
TCGTCACGCG 
TCGTCACGCG 
TCGTCACGCG 



GGCATGAATC 
GGCATGAATC 
GGCATGAATC 
GGCATGAATC 
GGCATGCTTC 
GGCATGCTTC 
GGCATGAATC 
GGCATGAATC 
GGCATGCTTC 
GGCATGCTTC 
GGCATGATTC 
GGCATGCTTC 



TGATGCTGTT 
TGATGCTGTT 
TGATGCTGTT 
TGATGCTGTT 
AGATGCTGTT 
AGATGCTGTT 
TGATGCTGTT 
TGATGCTTTT 
AGATGCTGTT 
AGATGCTGCT 
AGATGCTGTT 
AGATGCTGTT 



TCCCTG.TCG 
TCCCTG.TCG 
TCCCTG.TCG 
TCCCTG.TCG 
TCCCTG.CAA 
TCCCTG.CAA 
TCCCTG.CAG 
TCCCTG.TAA 
TCCCTG.CAA 
TCCCTG.CAA 
TCCCTG.CAA 
TCCCTG.CAA 



ACAATGCGAA 
ACAATGCGAA 
ACAATGCGAA 
ACAATGCGAA 
GACATGCGAG 
GACATGCGAG 
ACAATGCGAG 
AACATGCGAG 
AACGTGCGAG 
AACGTGCGAG 
AACGTGCGAG 
AACATGCGAG 
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Fig. lAO 



2001 

AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
A.ACTGGACC 
AGAATGAATC 
AGAATGAATC 
A.ACTGGACC 
AGAATGAATC 
A.ACTGGACC 
A . ACTAGACC 
AGAATGAATC 
AGAATGAATC 
AAAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 



AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
A. .ATGAGAA 
AGAATTTCAA 
AGAATTTCAG 
A. .ATGAGAA 
AGAATTTCAA 
A. .ATGAGAA 
A. .ATGAGAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 



CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CTTTCCCTTC 
CATTTGCTTC 
CATTTGCTTC 
CTTTCCCTTC 
CATTTGCTTC 
CTTTCCCTTC 
CTTTCCCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 



ACGCACGGGA 
ACGCACGGGA 
ACGCGCGGGA 
ACGCACGGGA 

A A 

ACGCACGGGA 
ACGCACGGGA 

A A 

ACGCACGGGA 

A A 

A A 

ACGCACGGGA 
ACGCACGGGA 
ACGCACGGGG 
ACGCACGGGG 
ACGCACGGGG 
ACGCACGGGA 
ACGCACGGGA 
ACGCACGGGA 
ACGCACGGGA 
ACGCACGGGA 
ACGCACGGGA 



2050 
CCAGAGACTG 
CCAGAGACTG 
CCAGAGACTG 
CCAGAGACTG 
CGATTGCGTC 
CCAGAGACTG 
CCAGAGACTG 
CGATTGCGTC 
CCAGAGACTG 
CGATTGCGTC 
CGATTGCGTC 
CCGGAGACTG 
CCAGAGACTG 
TCAGAGACTG 
TCAGAGACTG 
TCAGAGACTG 
CCAGAGACTG 
CCAGAGACTG 
CCAGAGACTG 
CCAGAGACTG 
CCAGAGACTG 
CCAGAGACTG 



AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 
AGAATGAATC 



AGAATTCAAA 

AGAATTCAAA 
AGAATTCAAA 
AGAATTCAAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTCAAA 
AAATTTCCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 
AGAATTTCAA 



TATCTGCTTC 

TATCTGCTTC 
TATCTGCTTC 
TATCTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
TATCTGCTTC 
TGTCTGTTTT 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 
CATTTGCTTC 



ACACACGGGC 

ACACACGGGC 
ACACACGGGC 
ACACACGGGC 
ACGCACGGGA 
ACGCACGGGA 
ACTCACGGAC 
ACGCATGGTC 
ACACACGGGC 
ACACACGGGC 
ACACACGGGC 
ACGCACGGGA 



AAAAAGACTG 
AAAAAGACTG 
AAAAAGACTG 
AAAAAGACTG 
CCAGAGACTG 
CCAGAGACTG 
AGAAAGACTG 
AAAGAGACTG 
TCAGAGACTG 
TCAGAGACTG 
TCAGAGACTG 
CCAGAGACTG 
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Fig . lAP 





2051 








9^ on 


42_2 


TTCAGAATGT 


TTCCCCGGLCj 




TP a app 

i k^Ari^^ .... 


PP'f'PPTP A 


42_8 


TTCAGAATGT 


TTCCCCGGCG 


TGI CACjAAib 


i UA/iCb .... 




42_15 


TTCAGAATGT 


TTCCCGGGCG 




rpp* 7\ nop 

1 v^AACL. .... 




42 5b 


TTCAGAATGT 






TPa APP 

1 V^rtriiV^v^ .... 


PPTPPTPA 


42 lb 


/—I 7\ TV TV 7\ 

GACAAGAI G(j 


i LjH i U i I o 




PPA APA 


TPAPGGrP 


42 13 


i I C ACjAA i Lr i 






TPAAPP 


GGTCGTCA 


42__ia 


rn rn 7\ 7\ 7\ rn rn 

Tl LAbAAi bi 




TrtTPAf^A ATP 


TPAAPP 

i. ^ZTXiV^V^ .... 


GGTCGTCA 


42 4 


GACAAGATGG 


i b/il 1 1 *o 




PPA APA 




42 5a 


TTCAGAATGi 




^n.TT'iin.AA'vc 

i vj 1 v^riiVujzAri JL \^ 


TPAAPP 


PPTPGTCA 


4z I U 




1 o/i 1 C^ 1 o 1 o 


GG AGGAGG 


GCAAGA. . . . 


. . TGACGGCC 


42 3b 


GACAAGATGG 


TGATCTGGTG 


GG. .AGGAGG 


GCAAGA. . . . 


. . TGACGGCC 


42 11 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


. .GGTCGTCA 


42 6b 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


a . GGTCGTCA 


43 1 


CTCAGAATGT 


ttccccggtg 


CATCAGAATC 


TCAACC. . . . 


. . GGTCGTCA 


43 5 


CTCAGAATGT 


TTCCCCGGTG 


CATCAGAATC 


TCAACC .... 


a .GGTCGTCA 


43 12 


CTCAGAATGT 


ttccccggtg 


CATCAGAATC 


TCAACC 


. , GGTCGTCA 


43 20 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TOAACL .... 




43 21 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


a , bLjlOblCA 


43 23 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


. , bCjti C(j i UA 


43 '25 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


. . GGT^^Gl CA 


44 1 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC. . . . 


. . GblCGiCA 


44 5 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


m /~i TV TV «^ 

TCAACC .... 


. . bbi 1 bi bA 


223 10 
223 2 
223 4 

223 5 
223 6 
223 7 
A3 4 






























































TTTGG7VATGC 


TTTCCCG . . . 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


A3 5 


TTTGGAATGC 


TTTCCCG. . . 


TGTCAGAATC 


TCAACCCGTT 


CCTGTCGTCA 


A3 7 


TTTGGAATGC 


TTTCCCG. . . 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


A3 3 


TTTGGAATGC 


TTTCCCG . . . 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


42__12 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


, . GGTCGTCA 


AAVl 


TTCAGAGTGC 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC. . . . 


. . GGTCGTCA 


AAV2 


TTTAGAGTGC 


TTTCCCG. . . 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


AAV 3 


TGGGGAATGC 


TTCCCTGGAA 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


AAV 8 


CTCAGAGTGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC. . . . 


. . GGTCGTCA 


AAV 9 


CTCAGAGTGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


a a GGTCGTCA 


AAV7 


TTTAGAGTGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


44 2 


TTCAG7\ATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


a . GGTCGTCA 
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Fig. lAQ 



2101 2150 
GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG . CG 



GAAAGAGGAC 
GAAAGAGGAC 
GAAAGAGGAC 
.AAGGTCGTG 
GAAAGAGGAC 
GAAAGAGGAC 
.AAGGTCGTG 
GAAAGAGGAC 
AA. . , .GGTC 

AA GGTC 

GAAAGAGGAC 
GAAAGAGGAC 
GAAAAAAAAC 
GAAAAAAAAC 
GAAAAAAAAC 
GAAAGAGGAC 
GAAAGAGGAC 
GAAAGAGGAC 
GAAAGAGGAC 
GAAAAAAGAC 
GAAAAAAGAC 



GTATCGGAAA 

GTATCGGAAA 
GTATCGGAAA 
GAGTCCGCCA 
GTATCGGAAA 
GTATCGGAAA 
GAGTCCGCCA 
GTATCGGAAA 
GTGAAGTCCG 
GTGGAGTCCG 
GTATCGGAAA 
GTATCGGAAA 
GTATCAGAAA 
GTATCAGAAA 
GTATCAGAAA 
GTATCGGAAA 
GTATCGGAAA 
GTATCGGAAA 
GTATCGGAAA 
GTATCGGAAA 
GTATCGGAAA 



CTCTGTGCCA 
CTCTGTGCCA 
CTCTGTGCCA 
AG ... . GCCA 
CTCTGTGCCA 
CTCTGTGCCA 
AG ... . GCCA 
CTCTGTGCCA 
CCAAG . GCCA 
CCAAG . GCCA 
CTCTGTGCCA 
CTCTGTGCCA 
CTCTGTGCCA 
CTGTGTGCCA 
CTGTGTGCCA 
CTCTGTGCCA 
CTCTGTGCCA 
CTCTGTGCCA 
CTCTGTGCCA 
CTCTGTGCCA 
CTCTGTGCCA 



TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 



GCTAGGG . CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 



GAAAAACG - . 
GAAAAACG . . 
GAAAAACG . . 
GAAAAACG . . 
GAAAGAGGAC 
GAAAGAGGAC 
AAAAGGCG . . 
AAAAGAAGAC 
GAAAGAGGAC 
GAAAGAGGAC 
GAAAAAAGAC 
GAAAAAAGAC 



. TATCAGAAA 
. TATCAGAAA 
.TATCAGAAA 
. TATCAGAAA 
GTATCGGAAA 
GTATCGGAAA 
. TATCAGAAA 
TTATCAGAAA 
GTATCGGAAA 
GTATCGGAAA 
GTATCGGAAA 
GTATCGGAAA 



CTTTGTTACA 
CTTTGTTACA 
CTTTGTTACA 
CTTTGTTACA 
CTCTGTGCCA 
CTCTGTGCCA 
CTGTGCTACA 
CTGTGTCCAA 
CTCTGTGCCA 
CTCTGTGCCA 
CTCTGCGCGA 
CTCTGTGCCA 



TTCATCATAT 
TTCATCATAT 
TTCATCATAT 
TTCATCATAT 
TTCATCATCT 
TTCATCATCT 
TTCATCATAT 
TTCATCATAT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 
TTCATCATCT 



CATGGGA. AA 
CATGGGA.AA 
CATGGGA. AA 
CATGGGA.AA 
GCTGGGG. CG 
GCTGGGG. CG 
CATGGGA.AA 
CCTGGGA. AG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGG. CG 
GCTGGGGGCG 
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Fig . lAR 



2151 

GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCACCCGAG 
GGCACCCGAG 
GGCACCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCACCCGAG 
GGCACCCGAG 



ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 



CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 



TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 



2200 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTAG 
GTGGACCTAG 



AGAACCAGAC ...GCCTGCA CTGCCTGCGA CCTGGTAAAT GTGGACTTGG 

AGTACCAGAC ...GCCTGCA CTGCCTGCGA CCTGGTAAAT GTGGACTTGG 

AGTACCAGAC ...GCCTGCA CTGCCTGCGA CCTGGTAAAT GTGGACTTGG 

AGTACCAGAC ...GCCTGCA CTGCCTGCGA CCTGGTAAAT GTGGACTTGG 

GGCTCCCGAG ATTGCTTGCT CGGCCTGCGA TCTGGTCAAC GTGGACCTGG 

GGCTCCCGAG ATTGCTTGCT CGGCCTGCGA TCTGGTCAAC GTGGACCTGG 

GGTGCCAGAC . . , GCTTGCA CTGCCTGCGA TCTGGTCAAT GTGGATTTGG 

GGCACCCGAG ATTGCCTGTT CGGCCTGCGA TTTGGCCAAT GTGGACTTGG 

GGCTCCCGAG ATTGCTTGCT CGGCCTGCGA TCTGGTCAAC GTGGACCTGG 

GGCTCCCGAG ATTGCTTGCT CGGCCTGCGA TCTGGTCAAC GTGGACCTGG 

GGCGCCCGAG ATTGCTTGCT CGGCCTGCGA CCTGGTCAAC GTGGACCTGG 

GGCACCCGAG ATTGCTTGCT CGGCCTGCGA TCTGGTCAAC GTGGACCTAG 
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Fig . IAS 



2201 



2250 



Rep 78 stop vpl start 

ATGACCGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAJ^TGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT A7y\CCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 



ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 

ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 

ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 

ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGCAT CTTTGAACAA TAAATGATTT AAATCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCTGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ACGACTGCGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGT A TG GCTGCCGA 

Rep78 stop vpl start 
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Fig . lAT 



2251 

TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTCATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 
TGGTTATCTT 



2300 

Rep 68 stop 

CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 



CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 
CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAi^iTCAGAC 
CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 
CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATCCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATAAGAC 
CGGTTATCTT CCAGATTGGC TCGAGGACAA CCTTTCTGAA GGCATTCGTG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

Rep 68 stop 
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Fig. lAU 



2301 2350 
AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 



AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 

AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 



CTTGAAACCT 

CTTGAAACCT 
CTTGAAACCT 
CTTGAGACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CCTGAAACCT 
CCTGAAACCT 
CCTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 



GGAGCCCCGA 

GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGZ-V 
GGAGCCCCGA 



AACCCAAAGC 

AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCATy^GC 
AACCCAAAGC 
T^ACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 



CAACCAGCAA 

CAACCAGC7\A 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 



AGTGGTGGAA 
AGTGGTGGAA 
AGTGGTGGAA 
AGTGGTGGAA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGAA 
AGTGGTGGGC 
AGTGGTGGGC 
AGTGGTGGGC 
AGTGGTGGGA 
AGTGGTGGGA 



CCTGAAACCT 
CCTGAAACCT 
GCTCAAACCT 
CCTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
GCTCAAACCT 
TCTGAAACCT 
CCTGAAACCT 
CCTGAAACCT 
CCTGAAACCT 
CTTGAAACCT 



GGCCCACCAC 
GGCCCACCAC 
GGCCCACCAC 
GGCCCACCAC 
GGAGCCCCGA 
GGAGCCCCGA 
GGCCCACCAC 
GGAGTCCCTC 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 



CGCCGAAACC 
CGCCGAAACC 
CGCCGAAACC 
CGCCGAAACC 
AACCCAAAGC 
AGCCCAAAGC 
CACC7VAAGCC 
AACCCAAAGC 
AGCCCAAAGC 
AGCCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 



TAACCAACAA 
TAACCAACAA 
TAACCAACAA 
TAACCAACAA 
CAACCAGCAA 
CAACCAGCAA 
CGCAGAGCGG 
GAACCAACAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
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2351 

AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 



ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 



Fig. lAV 



TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 



CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 



2400 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 



CACCGGGACG 
CACCGGGACG 
CACCGGGACG 
CACCGGGACG 
AAGCAGGACG 
AAGCAGGACG 
CATAAGGACG 
CACCAGGACA 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACA 
AAGCAGGACG 



ACAGTAGGGG 
ACAGTAGGGG 
ACAGTAGGGG 
ACAGTAGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACAGCAGGGG 
ACCGTCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 



TCTTGTGCTT 

TCTTGTGCTT 
TCTTGTGCTT 
TCTTGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTTGTGCTT 
TCTTGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 



CCTGGCTACA 
CCTGGGTACA 
CCTGGCTACA 
CCTGGGTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGGTACA 
CCGGGTTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 



AGTACCTCGG 

AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AATACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
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Fig. lAW 



2401 

ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 



GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 



AGGGAGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGAGAGCC 
AGGGAGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGAGAGCC 
AGGGAGAGCC 
AGGGAGAGCC 
AGGGAGAGCC 
AGGGAGAGCC 
AGGGAGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 



GGTCAACGAG 
CGTCAACGCG 
CGTCAACGCG 
GGTCAACGAG 
GGTCAACGAG 
CGTCAACGCG 
CGTCAACGCG 
GGTCAACGAG 
GGTCAACGAG 
GGTCAACGAG 
GGTCAACGAG 
CGTCAACGCG 
GGTCAACGAG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 



2450 
GCAGACGCCG 
GCGGACGCAG 
GCGGACGCAG 
GCAGACGCCG 
GCAGACGCCG 
GCGGACGCAG 
GCGGACGCAG 
GCAGACGCCG 
GCAGACGCCG 
GCAGACGCCG 
GCAGACGCCG 
GCGGACGCAG 
GCAGACGCCG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 



ACCCTTCAAC 

ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCGGTAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 
ACCCTTCAAC 



GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 
GGACTCGACA 



AAGGAGAGCC 
AAGGAGAGCC 
AAGGAGAGCC 
7\AGGAGAGCC 
AGGGAGAGCC 
AGGGGGAGCC 
AGGGAGAGCC 
AAGGAGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 
AGGGGGAGCC 



GGTCAACGAG 

GGTCAACGAG 
GGTCAACGAG 
GGTCAACGAG 
GGTCAACGAG 
CGTCAACGCG 
GGTCAACGAG 
GGTCAACGAG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 
CGTCAACGCG 



GCAGACGCCG 
GCAGACGCCG 
GCAGACGCCG 
GCAGACGCCG 
GCAGACGCCG 
GCGGACGCAG 
GCAGACGCCG 
GCGGACGCGG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
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2451 

42_2 CGGCCCTCGA GCACG.ACAA 

42_8 CGGCCCTCGA GCACG.ACAA 

4 2__15 CGGCCCTCGA GCACG.ACAA 

4 2__5b CGGCCCTCGA GCACG.ACAA 

4 2_lb CGGCCCTCGA GCACG.ACAA 

4 2__13 CGGCCCTCGA GCACG.ACAA 

4 2_3a CGGCCCTCGA GCACG.ACAA 

4 2_4 CGGCCCTCGA GCACG.ACAA 

4 2_5a CGGCCCTCGA GCACG.ACAA 

42__10 CGGCCCTCGA GCACG.ACAA 

42_3b CGGCCCTCGA GCACG.ACAA 

42_11 CGGCCCTCGA GCACG.ACAA 

4 2_6b CGGCCCTCGA GCACG.ACAA 

4 3__1 CGGCCCTCGA GCACG.ACAA 

4 3__5 CGGCCCTCGA GCACG.ACAA 

4 3_12 CGGCCCTCGA GCACG.ACAA 

4 3__2 0 CGGCCCTCGA GCACG.ACAA 

4 3_21 CGGCCCTCGA GCACG.ACAA 

4 3_2 3 CGGCCCTCGA GCACG.ACAA 

4 3_2 5 CGGCCCTCGA GCACG.ACAA 

4 4_1 CGGCCCTCGA GCACG.ACAA 

44 5 CGGCCCTCGA GCACG.ACAA 



223__10 CAA 

22 3_2 CAA 

22 3_4 CAA 

22 3_5 CAA 

22 3_6 CAA 

22 3 7 CAA 



A3_4 CGGCCCTCGA GCACG.ACAA 

A3_5 CGGCCCTCGA GCACG.ACAA 

A3_7 CGGCCCTCGA GCACG.ACAA 

A3__3 CGGCCCTCGA GCACG.ACAA 

42_12 CGGCCCTCGA GCACG.ACAA 

AAVl CGGCCCTCGA GCACG.ACAA 

AAV2 CGGCCCTCGA GCACGTACAA 

AAV3 CAGCCCTCGA ACACG.ACAA 

AAV8 CGGCCCTCGA GCACG.ACAA 

AAV9 CGGCCCTCGA GCACG.GCAA 

AAV7 CGGCCCTCGA GCACG.ACAA 

44 2 CGGCCCTCGA GCACG.ACAA 



Fig. lAX 

2500 

GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC CAGCAGCTCA ^J^GCGGGTGA 
GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
AGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
AGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
AGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
AGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
AGCCTACGAC CAGCAGCTCA AGCAAGGGGA 
AGCCTACGAC CAGCAGCTCA AGCAAGGGGA 
AGCCTACGAC CAGCAGCTCA AGCAAGGGGA 
AGCCTACGAC CAGCAGCTCA AGCAAGGGGA 
GGCCTACGAC AAGCAGCTCG AGCAGGGGGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
AGCCTACGAC CGGCAGCTCG ACAGCGGAGA 
AGCTTACGAC CAGCAGCTCA AGGCCGGTGA 
GGCCTACGAC CAGCAGCTGC AGGCGGGTGA 
GGCCTACGAC CAGCAGCTGC AGGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
GGCCTACGAC CAGCAGCTCA AAGCGGGTGA 
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CTCAAGTACA ACCACGCCGA 

CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ATCACGCCGA 
CTGCGGTATA ATCACGCCGA 
CTGCGGTATA ATCACGCCGA 
CTGCGGTATA ATCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTCAAATACA ACCACGCGGA 
CTCAAATACA ACCACGCGGA 
CTCAAATACA ACCACGCGGA 
CTCAAATACA ACCACGCGGA 
CTCAAGTACA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTCAAGTACA ACCACGCCGA 
CTGCGGT/VTA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
CTGCGGTATA ACCACGCCGA 
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CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGTGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCTGAATTT CAGGAGCGTC 
CGCTGAATTT CAGGAGCGTC 
CGCTGAATTT CAGGAGCGTC 
CGCTGAATTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCGGAGTTT CAGGAGCGCC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
CGCCGAGTTT CAGGAGCGTC 
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TTCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TGCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TTCAAGAAGA 
TGCAAGAAGA 
TTAAAGAAGA 
TTCAAGAAGA 
TGCAAGAAGA 
TGCAAG7VAGA 
TGCAAGAAGA 
TGCAAGAAGA 



TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCCTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTC 
TACGTCTTTC 
TACGTCTTTC 
TACGTCTTTC 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCTTTT 
TACGTCATTT 
TACGTCTTTT 
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GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 
GGGGGCAACC 



TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGACGAGC 
TTGGCAGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
TCGGGCGAGC 
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AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCGG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
AGTCTTCCAG 
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GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCC7VAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAATIAGC 
GCCAAAAAGC 
GCCAAAAAGC 
GCCAAAAAGC 
GCCAAAAAGC 
GCCAAAAAGC 
GCCAAAAAGA 
GCCAAAAAGA 
GCCAAAAAGA 
GCCAAAAAGA 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAAAAGA 
GCCAAAAAGA 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 
GCCAAGAAGC 



GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTACTCGA 
GGGTACTCGA 
GGGTACTCGA 
GGGTACTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTTGA 
GGATCCTTGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
GGGTTCTCGA 
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ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTTGGT 
ACCTCTTGGT 
ACCTCTTGGT 
ACCTCTTGGT 
ACCTCTTGGT 
ACCTCTTGGT 
GCCTCTTGGT 
GCCTCTTGGT 
GCCTCTTGGT 
GCCTCTTGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTGGGC 
GCCTCTTGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 
ACCTCTCGGT 



CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGA 
CTGGTTGAGA 
CTGGTTGAGA 
CTGGTTGAGA 
CTGGTTGAGA 
CTGGTTGAGA 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
CTGGTTGAGG 
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AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAP. 
AAGGCGCTAA 
AAGGCGCTAA 
CGCCAGCTAA 
CGCCAGCTAA 
CGCCAGCTAA 
CGCCAGCTAA 
CGCCAGCTAA 
CGCCAGCTAA 
AAGCTGTTAA 
AAGCTGTTAA 
AAGCTGTTAA 
AAGCTGTTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGCTGTTAA 
AAGCAGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
AAGGCGCTAA 
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vp2 start 

4 2_2 GACGGCTCCT GGAAAGAAGA GACCCATAGA . . .ATCCCCC 

42_8 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

42^15 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

4 2_5b GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

42_lb GACGGCTCCT GGAAAGAAGA GACCCATAGA . . . ATCCCCC 

4 2_13 GACGGCTCCT GGAAAGAAGA GACCCATAGA ...ATCCCCC 

42_3a GACGGCTCCT GGAAAGAAGA GACCCATAGA ...ATCCCCC 

42_4 GACGGCTCCT GGAAAGAAGA GACCCATAGA ...ATCCCCC 

4 2_5a GACGGCTCCT GGAAAGAAGA GACCCATAGA ...ATCCCCC 

42_10 GACGGCTCCT GGAAAGAAGA GACCCATAGA ...ATCCCCC 

4 2_3b GACGGCTCCT GGAAAGAAGA GACCCATAGA ...ATCCCCC 

42_11 GACGGCTCCT GGAAAGAAGA GACCCATAGA ...ATCCCCC 

42_6b GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

4 3_1 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCT CAGCGTTCCC 

4 3_5 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCT CAGCGTTCCC 

4 3_12 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCT CAGCGTTCCC 

4 3_20 GACGGCTCCT GGAAAGAAGA GACTGGTAGA GCAGTCGCCA CAAGAG . . . C 

4 3__21 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCAGTCGCCA CAAGAG ... C 

4 3^^2 3 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCAGTCGCCA CAAGAG ... C 

4 3_25 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCAGTCGCCA CAAGAG ... C 

44'_1 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

4 4~5 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

22 3_10 GACGGCACCT GGAAAGAAGC GACCGGTAGA CTCGCCA 

223_2 GACGGCACCT GGAAAGAAGC GACCGGTAGA CTCGCCA 

22 3_4 GACGGCACCT GGAAAGAAGC GACCGGTAGA CTCGCCA 

223_5 GACGGCACCT GGAAAGAAGC GACCGGTAGA CTCGCCA 

22 3_6 GACGGCACCT GGAAAGAAGC GACCGGTAGA CTCGCCA 

22 3__7 GACGGCACCT GGAAAGAAGC GACCGGTAGA CTCGCCA 

A3_4 GACGGCTCCT GGAAAAAAGA GACCTATAGA GCAGTCTCCT GCAGAA , . , C 

A3_5 GACGGCTCCT GGAA7\AAAGA GACCTATAGA GCAGTCTCCT GCAGAA...C 

A3__7 GACGGCTCCT GGAAAAAAGA GACCTATAGA GCAGTCTCCT GCAGAA...C 

A3_3 GACGGCTCCT GGAAA/^GA GACCTATAGA GCAGTCTCCT GCAGAA...C 

42__12 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

AAVl GACGGCTCCT GGAAAGAAAC GTCCGGTAGA GCAGTCGCCA CAAGAG... C 

AAV2 GACGGCTCCG GGAAAAAAGA GGCCGGTAGA GCAGTCTCCT GTGGAG...C 

AAV3 AACGGCTCCT GGAAAGAAGC GGGCTGTAGA TCAGTCTCCT CAGGAA...C 

AAV8 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

AAV9 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 

AAV7 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCGTCACCT CAGCGTTCCC 

44_2 GACGGCTCCT GGAAAGAAGA GACCGGTAGA GCCATCACCC CAGCGTTCTC 
vp2 start 
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CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGACAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAGGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGACAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAG7\AAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAG7\AAG 
CTCGGGCATC GGCGAATCAG 
CTCGGGCATC GGCAAATCAG 
CTCGGGCATC GGCAAATCAG 
CTCGGGCATC GGCAAATCAG 
TACGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGAACC GGAAAGGCGG 
ATCTGGTGTT GGCAAATCGG 
TACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 
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GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCT7VAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCCAGAAAG 
GCCAGCAGCC CGCGAGAAAG 
GCCAGCAGCC CGCGAGAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGi\AA 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC TGCAAGAAAA 
GCAAACAGCC TGCCAGAAAA 
GCC7y\CAGCC CGCCAGAAAA 
GCCAACAGCC CGCCAGAAAA 
GCCAGCAGCC CGCCAGAAAG 
GCCAGCAGCC CGCGAAAAAG 
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A3 4 
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A3 5 
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A3 7 
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A3 3 


AGACTCAATT 


42_12 


AGACTCAACT 


AAVl 


AGACTCAATT 


AAV2 


AGATTGAATT 


AAV 3 


AGACTAAATT 


AAV 8 


AGACTCAATT 


AAV 9 


AGACTCAATT 


AAV 7 


AGACTCAATT 


44 2 


AGACTCAACT 



Fig. lAAD 



TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCG 
TTGGGCAGAC TGGCGACTCG 
TTGGGCAGAC TGGCGACTCG 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGGCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGAGACGCA 
TCGGTCAGAC TGGAGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TCGGTCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 



2800 

GAGTCAGTGC CCGACCCCCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCCCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCACA 
GAGTCAGTGC CCGACCCACA 
GAGTCAGTGC CCGACCCACA 
GAGTCAGTGC CCGACCCACA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGCCAGTCC CCGACCCTCA 
GAGCCAGTCC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CAGACCCTCA 
GAGTCAGTGC CAGACCCTCA 
GAGTCAGTGC CAGACCCTCA 
GAGTCAGTGC CAGGCCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGATCCACA 
GACTCAGTAC CTGACCCCCA 
GAGTCAGTGC CAGACCCTCA 
GAGTCAGTTC CAGACCCTCA 
GAGTCAGTTC CAGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
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2801 



2850 



vpSfa^start 



4 2_2 
42_8 
42_15 
42__5b 
42_lb 
42_13 
42__3a 
42_4 
42_5a 
42__10 
42_3b 
42__11 
42__6b 
4 3_1 
4 3_5 
43_12 
43^20 
4 3_21 
43_23 
4 3_25 
44_1 
44_5 
223__10 
22 3_2 
223_4 
223^5 
223__6 
223_7 
A3_4 
A3__5 
A3_7 
A3__3 
42_12 
AAVl 
AAV2 
AAV 3 
AAV 8 
AAV9 
AAV7 
44 2 



ACCTCTCGGA GAACCTCCCG CCGCGCCCTC AGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGCACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCTCTCGGA GAACCTCCCG CCGCGCCCTC AGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAJ^.TACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTCTGGGA TCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTCTGGGA TCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTCTGGGA TCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTCTGGGA TCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCTCTCGGA GAACCTCCAG CAACCCCCGC TGCTGTGGGA CCTACTACAA 

GCCTCTCGGA CAGCCACCAG CAGCCCCCTC TGGTCTGGGA ACTAATACGA 

ACCTCTCGGA GAACCACCAG CAGCCCCCAC AAGTTTGGGA TCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC TGGTCTGGGA CCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC TGGTGTGGGA CCTAATAC/^^ 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC TAGTGTGGGA TCTGGTACAG 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 




vp3 start 
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^p3 start codon 

42_2 TckcCTGCAGG CGGTGGCGCA 

42_8 TicCTGCAGG CGGTGGCGCT 

42_15 TGGCTGCAGG CGGTGGCGCT 

4 2_5b TGGCTGCAGG CGGTGGCGCT 

4 2_lb TGGCTGCAGG CGGTGGCGCT 

4 2_13 TGGCTGCAGG CGGTGGCGCT 

42_3a TGGCTGCAGG CGGTGGCGCT 

42_4 TGGCTGCAGG CGGTGGCGCT 

4 2_5a TGGCTGCAGG CGGTGGCGCA 

42_10 TGGCTGCAGG CGGTGGCGCT 

42_3b TGGCTGCAGG CGGTGGCGCT 

42_11 TGGCTGCAGG CGGTGGCGCT 

42_6b TGGCTGCAGG CGGTGGCGCT 

4 3__1 TGGCTGCAGG CGGTGGCGCT 

4 3_5 TGGCTGCAGG CGGTGGCGCT 

4 3_12 TGGCTGCAGG CGGTGGCGCT 

4 3_20 TGGCTTCAGG CGGTGGCGCT 

4 3_21 TGGCTTCAGG CGGTGGCGCT 

4 3__2 3 TGGCTTCAGG CGGTGGCGCT 

4 3__25 TGGCTTCAGG CGGTGGCGCT 

4 4_1 TGGCTGCAGG CGGTGGCGCT 

44_5 TGGCTGCAGG CGGTGGCGCT 

223_10 TGGCTGCAGG CGGTGGCGCA 

223_2 TGGTTGCAGG CGGTGGCGCA 

223_4 TGGCTGCAGG CGGTGGCGCA 

223_5 TGGCTGCAGG CGGTGGCGCA 

223_6 TGGCTGCAGG CGGTGGCGCA 

22 3_7 TGGCTGCAGG CGGTGGCGCA 

A3_4 TGGCTTCAGG CGGTGGCGCA 

A3__5 TGGCTTCAGG CGGTGGCGCA 

A3__7 TGGCTTCAGG CGGTGGCGCA 

A3__3 TGGCTTCAGG CGGTGGCGCA 

42_12 TGGCTGCAGG CGGTGGCGCT 

AAVl TGGCTTCAGG CGGTGGCGCA 

AAV2 TGGCTACAGG CAGTGGCGCA 

AAV3 TGGCTTCAGG CGGTGGCGCA 

AAV8 TGGCTGCAGG CGGTGGCGCA 

AAV9 TGGCTGCAGG CGGTGGCGCA 

AAV7 TGGCTGCAGG CGGTGGCGCA 

4 4__2 TGGCTGCAGG CGGTGGCGCT 
vp3 start codon (cont'c 



Fig. lAAF 

2900 

CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATMCGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAAT?^CGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATZ\ACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACA^.TAACGA AGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAGCGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCAG ACGATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAAT7VACGA GGGCGCCGAC 
CCAATGGCAG ACAATAACGA GGGTGCCGAT 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CC7U\TGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGTGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
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2901 


42 2 


GGAGTGGGTA 


42 8 


GGAGTGGGTA 


42 15 


GGAGTGGGTA 


42 5b 


GGAGTGGGTA 


42 lb 


GGAGTGGGTA 


42 13 


GGAGTGGGTA 


42 3a 


GGAGTGGGTA 


42 4 


GGAGTGGGTA 


42 5a 


GGAGTGGGTA 


42 10 


GGAGTGGGTA 


42 3b 


GGAGTGGGTA 


42 11 


GGAGTGGGTA 


42 6b 


GGAGTGGGTA 


43 1 


GGAGTGGGTA 


43 5 


GGAGTGGGTA 


43 12 


GGAGTGGGTA 


43 20 


GGAGTGGGTA 


43 21 


GGAGTGGGTA 


43 23 


GGAGTGGGTA 


43 25 


GGAGTGGGTA 


44 1 


GGAGTGGGTA 


44 5 


GGAGTGGGTA 


223 10 


GGAGTGGGTA 


223 2 


GGAGTGGGTA 


223 4 


GGAGTGGGTA 


223 5 


GGAGTGGGTA 


223 6 


GGAGTGGGTA 


223 7 


GGAGTGGGTA 


A3 4 


GGAGTGGGTA 


A3 5 


GGAGTGGGTA 


A3 7 


GGAGTGGGTA 


A3 3 


GGAGTGGGTA 


42 12 


GGAGTGGGTA 


AAVl 


GGAGTGGGTA 


AAV2 


GGAGTGGGTA 


AAV3 


GGAGTGGGTA 


AAV 8 


GGAGTGGGTA 


AAV 9 


GGAGTGGGTA 


AAV7 


GGAGTGGGTA 


AAVIO 


GGTA 


AAVll 


GGTA 


AAVl 2 


GGTA 


44 2 


GGAGTGGGTA 



Fig. lAAG 



ATGCCTCCGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG 7\AATTGGCAT 
ATGCCTCCGG AAATTGGCAT 
ATGCCTCCGG AAATTGGCAT 
ATGCCTCCGG AAATTGGCAT 
ATGCCTCCGG AAATTGGCAT 
ATGCCTCCGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
ATGCCTCAGG AAATTGGCAT 
ATGCCTCAGG AAATTGGCAT 
ATGCCTCAGG AAATTGGCAT 
ATGCCTCAGG AAATTGGCAT 
ATGCCTCAGG AAATTGGCAT 
ATGCCTCAGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATTCCTCGGG AT^ATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 
ATGCCTCAGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATTCCTCAGG AAATTGGCAT 
GTTCCTCGGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATGCCTCAGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
ATTCCTCGGG AAATTGGCAT 
GTTCCTCAGG AAATTGGCAT 



2950 

TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATAGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CACGGCTGGG 
TGCGATTCCA CACGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGATGGG 
TGCGATTCCA CATGGATGGG 
TGCGATTCCA CATGGATGGG 
TGCGATTCCA CATGGATGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGATGGG 
TGCGATTCCC AATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
TGCGATTCCA CATGGCTGGG 
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42_2 
42_8 

42___15 
42_5b 
42_lb 
42_13 
42_3a 
4 2_4 
42__5a 
42_10 
42_3b 
42_11 
42_6b 
4 3_1 
4 3_5 
43_12 
4 3__20 
4 3__21 
43_23 
4 3_2 5 
44__1 
44_5 
223_10 
22 3_2 
223_4 
223__5 
223_6 
223_7 
A3_4 
A3__5 
A3__7 
A3_3 
42_12 
AAVl 
AAV2 
AAV3 
AAV8 
AAV9 
AAV 7 
AAV 10 
AAVll 
AAVl 2 
44 2 
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CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
GGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTT 
CGACAGAGTT 
CGACAGAGTT 
CGACAGAGTT 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
GGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACCGAGTC 
CGACAGAGTC 



ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATCACCACCA 
ATTACCACCA 
ATCACCACCA 
ATCACCACCA 
ATTACCACCA 
ATCACCACCA 



GCACCCGCAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGCAC 
GCACCCGCAC 
GCACCCGCAC 
GCACCCGCAC 
GCACCCGCAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACAAGAAC 
GCACAAGAAC 
GCACAAGAAC 
GCACAAGAAC 
GCACCCGAAC 
GCACCCGCAC 
GCACCCGAAC 
GCACCAGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGAAC 
GCACCCGCAC 
GCACCCGAAC 



CTGGGCCCTG 
CTGGGCCCTC 
CTGGGCCCTG 
CTGGGCCCTC 
CTGGGCCCTC 
CTGGGCCCTC 
CTGGGCCCTC 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTC 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTC 
CTGGGCCCTC 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTC 
CTGGGCCCTC 
CTGGGCCCTC 
CTGGGCCCTC 
CTGGGCCCTC 
CTGGGCCTTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCCCTG 
CTGGGCATTG 
CTGGGCCCTG 
CTGGGTCCTG 
CTGGGCCCTG 
TTGGGCCCTG 
CTGGGCCCTC 



3000 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACTTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCCACCTACA 
CCAACCTACA 
CCCACCTACA 
CCCACCTACA 
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3001 


42 2 


TV TV TV *^/^ TV /~« /~im 

ACAACCACCT 


/I o o 


ACAACCACCT 


/I O "1 c 


TV / — ( TV 7\ TV /~» / — I m 

ACAACCACC I 
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H. Z OD 


ACAACCACC 1 


ft Z ID 


ACAACCACC 1 


4 Z X o 


ACAACCACC 1 


4 z 


ACAACCACC 1 


4 Z 4 


ACAACCACC 1 


4 z oa 
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4 Z 1 U 
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4 o 1 Z 
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4 o Z U 
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4 o Z 1 
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4 O Z O 
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A '^O c; 




AA 1 


A PA AP pa PPT 


4 4*""^ 




223 To 


ACAACCACCT 


223^ 2 


ACAACCACCT 


223~"4 


ACAACCACCT 


223~5 


ACAACCACCT 


223^6 


ACAACCACCT 


223~7 


ACAACCACCT 


A3"*4 


ATAATCACCT 


A3~5 


ATAATCArPT 


A3~7 


ATAATCGCCT 


A3 3 


ATAATCACCT 


42 12 


ACAACCACCT 


AAVl 


ATAACCACCT 


AAV2 


ACAACCACCT 


AAV3 


ACAACCATCT 


AAV 8 


ACAACCACCT 


AAV9 


ACAACCACCT 


AAV7 


ACAACCACCT 


AAVIO 


ACAACCACAT 


AAVll 


ACAACCACCT 


AAVl 2 


ACAACCACCT 


44 2 


ACAACCACCT 



CTACAAGCAG ATATCAA..G 

CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAG ATATCAA. . . 
CTACAAGCAG ATATCAA. . . 
CTACAAGCAG ATATCAA G 
CTACAAGCAG ATATCAA..G 
CTACAAGCAG ATATCAA . . G 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCC/^CG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAGTC 
CTACAAGCAA ATCTCCAGTC 
CTACAAGCAA ATCTCCAGTC 
CTACAAGCAA ATCTCCAGTC 
CTACAAGCAA ATCTCCAGTC 
CTACAAGCAA ATCTCCAGTC 
CTACAAGCAA ATCTCCA , . . 
CTACAAGCAA ATCTCCA. . . 
CTACAAGCAA ATCTCCA. . . 
CTACAAGCAA ATCTCCA. . . 
CTAC7VAGCAA ATCTCCZVACG 
CTACAAGCAA ATCTCCAGTC 
CTACATIACAA ATTTCCA. , . 
CTACAAGCAA ATCTCCA. . . 
CTACAAGCAA ATCTCCAACG 
CTACAAGCAA ATCTCCAATG 
CTACAAGCAA ATCTCCAGTG 
CTACAAGCAA ATCTCCAGCG 
CTACAAACAA ATCTCCAGCG 
CTACAAGCAA ATCTCCAGCG 
CTACAAGCAA ATCTCCAACG 
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TCAGAGCGGG GCT. . . .ACC 

GGACATCGGG AGGAAGCACC 
GGACATCGGG AGGAAGCACC 
GGACATCGGG AGGAAGCACC 
GGACATCGGG AGGAAGCACC 
GGACATCGGG AGGAAGCACC 
GGACATCGGG AGGAAGCACC 
. GTCAGAGCG GGGC . . TACC 
. GTCAGAGCG GGGC . . TACC 
TCAGAGCGGG GCTA. . . . CC 
TCAGAGCGGG GCTA. . , . CC 
TCAGAGCGGG GCTA. . . . CC 
GGACATCGGG AGGAAGCACC 
GGACATCGGG AGGAAGCACT 
GGACATCGGG AGGAAGCACT 
GGACATCGGG AGGAAGCACT 
GCACCTCGGG AGGAAGCACC 
GCACCTCGGG AGGAAGCACC 
GCACCTCGGG AGGAAGCACC 
GCACCTCGGG AGGAAGCACC 
GGACTTCGGG AGGAAGCACC 
GGACTTCGGG AGGAAGCACC 
AGTCAGCAGG GAG . . . CACC 
AGTCAGCAGG GAG . . . CACC 
AGTCAGCAGG GAG . . . CACC 
AGTCAGCAGG GAG . . . CACC 
AGTCAGCAGG GAG . . . CACC 
AGTCAGCAGG GAG . . . CACC 
GCGAATCGGG AGC . . . CACC 
GCGAATCGGG AGC . . . CACC 
GCGAATCGGG AGC . . . CACC 
GCGAATCGGG AGC . . . CACC 
GGACATCGGG AGGAAGCACC 
CTTCAACGGG . GG . . CCAGC 
GCCAATCAGG AGC. , . CTCG 
GCCAATCAGG AGC. , . TTCA 
GGACATCGGG AGGAGCCACC 
GAACATCGGG AGGAAGCACC 
AAACTGCAGG TAG. . .TACC 
AGACAGGAGC CACCAACGAC 
CTTCAACGGG GGCCAGCAAC 
AATCGGGTGC CACCAACGAC 
GGACTTCGGG AGGAAGCACC 
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3051 

42__2 AACGACAACC ACTTCTTCGG CTACAGCACC 

4 2_8 AACGACAACA CCTACTTCGG CTACAGCACC 

42__15 AACGACAACA CCTACTTCGG CTACAGCACC 

42__5b AACGACAACA CCTACTTCGG CTACAGCACC 

4 2__lb AACGACAACA CCTACTTCGG CTACAGCACC 

42_13 AACGACAACA CCTACTTCGG CTACAGCACC 

42_3a AACGACAACA CCTACTTCGG CTACAGCACC 

4 2_4 AACGACAACC ACTTCTTCGG CTACAGCACC 

42__5a AACGACAACC ACTTCTTCGG CTACAGCACC 

42_10 AACGACAACC ACTTCTTCGG CTACAGCACC 

4 2_3b AACGACAACC ACTTCTTCGG CTACAGCACC 

42_11 AACGACAACC ACTTCTTCGG CTACAGCACC 

42_6b AACGACAACA CCTACTTCGG CTACAGCACC 

4 3_1 AACGACAACA CCTACTTTGG CTACAGCACC 

4 3_5 AACGACAACA CCTACTTTGG CTACAGCACC 

43__12 AACGACAACA CCTACTTTGG CTACAGCACC 

4 3_20 AACGACAACA CCTATTTTGG CTACAGCACC 

4 3_21 AACGACAACA CCTATTTTGG CTACAGCACC 

4 3_2 3 AACGACAACA CCTATTTTGG CTACAGCACC 

43__25 AACGACAACA CCTATTTTGG CTACAGCACC 

4 4__1 AACGACAACA CCTACTTCGG CTACAGCACC 

4 4_5 AACGACAACA CCTACTTCGG CTACAGCACC 

223_10 AACGATAACG TCTATTTCGG CTACAGCACC 

223__2 AACGATAACG TCTATTTCGG CTACAGCACC 

22 3_4 AACGATAACG TCTATTTCGG CTACAGCACC 

223_5 AACGATAACG TCTATTTCGG CTACAGCACC 

223_6 AACGATAACG TCTATTTCGG CTACAGCACC 

22 3_7 AACGATAACG TCTATTTCGG CTACAGCACC 

A3~4 AACGACAACC ACTACTTCGG CTACAGCACC 

A3__5 AACGACAACC ACTACTTCGG CTACAGCACC 

A3_7 AACGACAACC ACTACTTCGG CTACAGCACC 

A3_3 AACGAC7y\CC ACTACTTCGG CTACAGCACC 

42_12 AACGACAACA CCTACTTCGG CTACAGCACC 

AAVl AACGACAACC ACTACTTCGG CTACAGCACC 

AAV2 AACGACAATC ACTACTTTGG CTACAGCACC 

AAV3 AACGACAACC ACTACTTTGG CTACAGCACC 

AAV8 AACGACAACA CCTACTTCGG CTACAGCACC 

AAV9 AACGACAACA CCTACTTTGG CTACAGCACC 

AAV7 AACGACAACA CCTACTTCGG CTACAGCACC 

AAVIO AACCACTACT TCGGCTACAG C ACC 

AAVll ...GACAACC ACTACTTTGG CTACAGCACC 

AAV12 AACCACTACT TCGGCTA CAGCACC 

44 2 AACGACAACA CCTACTTCGG CTACAGCACC 



3100 

CCCTGGGGCT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGCT ATTTTGACTT 
CCCTGGGGCT ATTTTGACTT 
CCCTGGGGCT ATTTTGACTT 
CCCTGGGGCT ATTTTGACTT 
CCCTGGGGCT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGATTT 
CCTTGGGGGT ATTTTGACTT 
CCTTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCCTGGGGGT ATTTTGACTT 
CCTTGGGGGT ATTTTGATTT 
CCCTGGGGGT ATTTTGACTT 
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4 3~25 
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44^ 1 
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223 10 


CAACAGATTC 


223 2 


CAACAGATTC 


223~4 


CAACAGATTC 


223~'5 


CAACAGATTC 


223~6 


CAACAGATTC 


223^7 


CAACAGATTC 


A3~4 


TAACAGATTC 


A3"~5 


TAACAGATTC 


A3 7 


TAACAGATTC 


A3~3 


TAACAGATTC 


42_12 


TAACAGATTC 


AAVl 


CAACAGATTC 


AAV2 


CAACAGATTC 


AAV3 


TAACAGATTC 


AAV 8 


TAACAGATTC 


AAV 9 


CAACAGATTC 


AAV7 


TAACAGATTC 


AAVIO 


TAACAGATTC 


AAVll 


TAACAGATTC 


AAVl 2 


CAACAGATTC 


44 2 


TAACAGATTC 



Fig. lAAK 



CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCATCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGTCACT TTTCACCACG 
CACTGTCACT TTTCACCACG 
CACTGTCACT TTTCACCACG 
CACTGTCACT TTTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CACTGTCACT TCTCACCACG 
CACTGTCACT TCTCACCACG 
CACTGTCACT TCTCACCACG 
CACTGTCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TTTCACCACG 
CACTGCCACT TTTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TTTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TTTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCATT TCTCACCACG 
CACTGCCACT TCTCACCACG 



3150 

TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAA CGACTCATCA 
TGACTGGCAA CGACTCATCA 
TGACTGGCAA CGACTCATCA 
TGACTGGCAA CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAA AGACTCATCA 
TGACTGGCAG CGACTCATTA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
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43 1 
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ACAATAACTG 
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ACAACAATTG 
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ACAACAATTG 


44 1 


ACAACAACTG 
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223 10 


ACAACAACTG 


223 2 


ACAACAACTG 


223 4 


ACAACAACTG 


223 5 


ACAACAACTG 


223 6 


ACAACAACTG 


223 7 


ACAACAACTG 


A3 4 


ACAACAACTG 


A3 5 


ATAACAACTG 


A3 7 


ACAACAACTG 


A3 3 


ACAACAACTG 


42__12 


ACAACAACTG 


AAVl 


ACAACAATTG 


AAV2 


ACAACAACTG 


AAV 3 


ACAACAACTG 


AAV8 


ACAACAACTG 


AAV 9 


ACAACAACTG 


AAV7 


ACAACAACTG 


AAVIO 


ACAACAACTG 


AAVll 


ACAACAACTG 


AAVl 2 


ACAACAACTG 


44 2 


ACAACAACTG 



Fig. lAAL 



GGGATTCCGG CCCAGAAAGC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAGAAAGC 
GGGATTCCGG CCCAGAAAGC 
GGGATTCCGG CCCAGAAAGC 
GGGATTCCGG CCCAGAAAGC 
GGGATTCCGG CCCAGAAAGC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAAAGAC 
GGGATTCCGG CCCAAAAGAC 
GGGATTCCGG CCCAAAAGAC 
GGGATTCCGG CCCAAAAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAAGC 
GGGATTCCGG CCCAAGAAGC 
GGGATTCCGG CCCAAGAAGC 
GGGATTCCGG CCCAAGAAGC 
GGGATTCCGG CCCAAGAAGC 
GGGATTCCGG CCCAAGAAGC 
GGGATTTAGA CCCAAGAAAC 
GGGATTTAGA CCCAAGAAAC 
GGGATTTAGA CCCAAGAAAC 
GGGATTTAGA CCCAAGAAAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGA CCCAAGAGAC 
GGGATTCCGG CCCAAGAAAC 
GGGATTCCGG CCCAAGAGAC 
GGGATTCCGG CCAAAGAGAC 
GGGATTCCGG CCCAAGAAGC 
GGGATTC 
GGGATTC 
GGGATTC 

GGGATTCCGG CCCAAGAGAC 



3200 

TGCGGTTCAA GTTGTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TGCGGTTCAA GTTGTTCAAC 
TGCGGTTCAA GTTGTTCAAC 
TGCGGTTCAA GTTGTTCAAC 
TGCGGTTCAA GTTGTTCAAC 
TGCGGTTCAA GTTGTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTGTTCAAC 
TCAACTTCAA GCTCTTCAAC 
CCAACTTCAA GCTCTTCAAC 
TC7\ACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAATTTC7\A GCTCTTCAAC 
TCAATTTCAA GCTCTTCAAC 
TCAATTTCAA GCTCTTCAAC 
TCAATTTCAA GCTCTTCAAC 
TCAACTTCAA GCTCTTCAAC 
TCAACTTCAA ACTCTTCAAC 
TCAACTTCAA GCTCTTTAAC 
TCAGCTTCAA GCTCTTCAAC 
TCAGCTTCAA GCTCTTCAAC 
TCAACTTCAA GCTGTTCAAC 
TGCGGTTCAA GCTCTTCAAC 



TCAACTTCAA GCTCTTCAAC 
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A3 4 
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A3 7 
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ATCCAAGTCA 


42_12 


ATCCAGGTCA 


AAVl 


ATCC7U\GTCA 
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AAV 3 


ATCCAAGTTA 


AAV 8 


ATCCAGGTCA 


AAV 9 


ATCCAGGTCA 


AAV 7 


ATCCAGGTCA 


44 2 


ATCCAGGTCA 



Fig. lAAM 



AGGAGGTCAC GACGAACGAC 

AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GACGGACGAC 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAAGTCAC GACGAACGAA 
AGGAAGTCAC GACGAACGAA 
AGGAAGTCAC GACGAACGAA 
AGGAAGTCAC GACGAACGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAG7VATGAA 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GCAGAATGAT 
AGGAGGTCAC GCAGAATGAT 
AGGAGGTCAC GCAGAATGAT 
AGGAGGTCAC GCAGAATGAT 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GACGAATGAT 
AAGAGGTCAC GCAGT^ITGAC 
GAGGGGTCAC GCAGAACGAT 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTTAC GACGAACGAA 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GCAGAATGAA 



3250 

GGCGTTACGA CCATCGCTAA 

GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACC7y\GA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCTAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCTU^GA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGTGTCACAA CCATCGCT7VA 
GGTGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGTGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTCACAA CCATCGCTAA 
GGTACGACGA CGATTGCCAA 
GGCACGACGA CTATTGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCACCAAGA CCATCGCCAA 
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A3 3 
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AAVl 
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Fig. lAAN 



AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTGTTTAC 
AGCACGATTC AGGTGTTTAC 
AGCACGATTC AGGTGTTTAC 
AGCACCGTGC AGGTCTTTAC 
AGCACCGTGC GGGTCTTTAC 
AGCACCGTGC AGGTCTTTAC 
AGCACCGTGC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTGC AGGTCTTCAC 
AGCACGGTGC AGGTCTTCAC 
AGCACGGTGC AGGTCTTCAC 
AGCGCGGTGC AGGTCTTCAC 
AGCACGATTC AGGTCTTTAC 
AGCACGGTTC AAGTCTTCTC 
AGCACGGTTC AGGTGTTTAC 
AGCACGGTTC AAGTGTTTAC 
AGCACCATCC AGGTGTTTAC 
AGCACCGTGC AGGTCTTTAC 
AGCACGATTC AGGTATTCTC 
AGCACGATTC AGGTCTTTAC 



3300 

GGACTCGGAG TACCAACTGC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCGGCTCC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAG TACCAGTTAC 
GGACTCGGAG TACCAGTTAC 
GGACTTGGAG TACCAGTTAC 
GGACTCGGAG TACCAGTTAC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TATCAACTGC 
GGACTCGGAA TATCAACTGC 
GGACTCGGAA TATCAACTGC 
GGACTCGGAA TATCAACTGC 
GGACTCGGAA TATCAACTGC 
GGACCCGGAA TATCAACTGC 
AGACTCTGAG TACCAGCTCC 
AGACTCTGAG TACCAGCTCC 
AGACTCTGAG TACCAGCTCC 
AGACTCTGAG TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAG TACCAGCTTC 
TGACTCGGAG TACCAGCTCC 
GGACTCGGAG TATCAGCTCC 
GGACTCGGAG TACCAGCTCC 
GGACTCGGAG TACCAGCTAC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
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3301 

CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCC 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTGCT 
CGTACGTCCT 
CGTACGTGCT 
CGTACGTGCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTGCT 
CGTACGTTCT 
CGTACGTCCT 
CGTACGTCCT 
CGTACGTCCT 



CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCTGCG 
AGGATCCGCT 
AGGATCCGCT 
AGGATCCGCT 
AGGATCCGCT 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCCGCG 
CGGCTCCGCG 
CGGCTCCGCG 
CGGCTCCGCG 
CGGCTCCGCG 
CGGCTCCGCG 
CGGTTCGGCT 
CGGTTCGGCT 
CGGTTCGGCT 
CGGTTCGGCT 
CGGCTCTGCG 
CGGCTCTGCG 
CGGCTCCGCG 
CGGGTCGGCG 
CGGCTCTGCG 
AGGCTCTGCC 
CGGCTCTGCG 
CGGCTCTGCG 



Fig . lAAO 



CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGAT 
CACCAGGGAT 
CACCAGGGAT 
CACCAGGGAT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CACCAGGGCT 
CATCAAGGAT 
CACCAAGGCT 
CACCAGGGCT 
CACCAAGGAT 
CACCAGGGCT 
CACCAGGGCT 



GCCTCCCTCC 
GCCTGCCTCC 
GCCCGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTCCCTCC 
GCCTCCCTCC 
GCCTCCCTCC 
GCCTCCCTCC 
GCCTCCCTCC 
GCCTCCCTCC 
GCCTCCCTCC 
GCCTCCCTCC 
GTCTGCCTCC 
GTCTGCCTCC 
GTCTGCCTCC 
GTCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTGCCTCC 
GCCTTCCGCC 
GCCTTCCGCC 
GCCTTCCGCC 
GCCTTCCGCC 
GCCTGCCTCC 
GCCTCCCTCC 
GCCTCCCGCC 
GTCTCCCGCC 
GCCTGCCTCC 
GCCTGCCACC 
GCCTGCCTCC 
GCCTGCCTCC 
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GTTCCCTGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCTGCG 
GTTCCCTGCG 
GTTCCCTGCG 
GTTCCCTGCG 
GTTCCCTGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCGGCA 
GTTCCCGGCA 
GTTCCCGGCA 
GTTCCCGGCA 
GTTCCCGGCA 
GTTCCCGGCA 
GTTCCCAGCA 
GTTCCCAGCA 
GTTCCCAGCA 
GTTCCCAGCA 
GTTCCCGGCG 
GTTCCCGGCG 
GTTCCCAGCA 
GTTTCCAGCG 
GTTCCCGGCG 
GTTTCCTGCA 
GTTCCCGGCG 
GTTCCCGGCG 
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3351 


42 2 


GACGTGTTCA 


42 8 


GACGTCTTCA 


42 15 


GACGTGTTCA 


42 5b 


GACGTCTTCA 


42 lb 


GACGTCTTCA 


42 13 


GACGTCTTCA 


42 3a 


GACGTCTTCA 


42 4 


GACGTCTTCA 


42 5a 


GACGTGTTCA 


42 10 


GACGTGTTCA 


42 3b 


GACGTGTTCA 


42 1 


GACGTGTTCA 


42 6b 


GACGTGTTCA 


43 1 


GACGTCTTCA 


43 5 


GACGTCTTCA 


43 12 


GACGTCTTCA 


43 20 


GACGTCTTCA 


43 21 


GACGTCTTCA 


43 23 


GACGTCTTCA 


43 25 


GACGTCTTCA 


44 1 


GACGTCTTCA 


44 5 


GACGTCTTCA 


223 10 


GACGTGTTCA 


223" 2 


GACGTGTTCA 


223 4 


GACGTGTTCA 


223 5 


GACGTGTTCA 


223 6 


GACGTGTTCA 


223 7 


GACGTGTTCA 


A3 4 


GACGTCTTCA 


A3 5 


GACGTCTTCA 


A3 7 


GACGTCTTCA 


A3 3 


GACGTCTTCA 


42^12 


GACGTCTTCA 


AAVl 


GACGTGTTCA 


AAV2 


GACGTCTTCA 


AAV 3 


GACGTCTTCA 


AAV8 


GACGTGTTCA 


AAV 9 


GACGTCTTCA 


AAV7 


GACGTCTTCA 


44 2 


GACGTCTTCA 



Fig, lAAP 



TGATTCCTCA GTACGGATAT 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGATAT 
TGATTCCTCA GTACGGATAT 
TGATTCCTCA GTACGGATAT 
TGATTCCTCA GTACGGATAT 
TGATTCCTCA GTACGGATAT 
TGATTCCTCA GTACGGGTAT 
TGATTCCTCA GTACGGGTAT 
TGATTCCTCA GTACGGGTAT 
CGGTTCCTCA GTACGGGTAT 
TGGTTCCTCA GTACGGGTAT 
TGGTTCCTCA GTACGGGTAT 
TGGTTCCTCA GTACGGGTAT 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCGCA GTACGGATAC 
TGATTCCGCA GTACGGATAC 
TGATTCCGCA GTACGGATAC 
TGATTCCGCA GTACGGATAC 
TGATTCCGCA GTACGGATAC 
TGATTCCGCA GTACGGATAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCGCA ATACGGCTAC 
TGGTGCCACA GTATGGATAC 
TGGTCCCTCA GTATGGATAC 
TGATTCCGCA GTACGGGTAC 
TGGTTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 
TGATTCCTCA GTACGGGTAC 



3400 

CTGACTCTAA ACAACGGCAG 
CTGACTCTGA ACAACGGCAG 
CTGACTCTGA ACAACGGCAG 
CTGACTCTGA ACAACGGCAG 
CTGACTCTGA ACAACGGCAG 
CTGACTCTGA ACAACGGCAG 
CTGACTCTGA ACAACGGCAG 
CTGACTCTGA ACAACGGCAG 
CTGACTCTAA ACAACGGCAG 
CTGACTCTAA ACAACGGCAG 
CTGACTCTAA ACAACGGCAG 
CTGACTCTAA ACAACGGCAG 
CTGACTCTAA ACAACGGCAG 
CTGACCCTAA ACAATGGCAG 
CTGACCCTAA ACAATGGCAG 
CTGACCCTAA ACAATGGCAG 
TTAACTTTAA ACAATGGAAG 
TTAACTTTAA ACAATGGAAG 
TTAACTTTAA ACAATGGAAG 
TTAACTTTAA ACAATGGAAG 
CTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAATGGCAG 
TTGACTCTGA ACAATGGCAG 
TTGACTCTGA ACAATGGCAG 
TTGACTCTGA ACAATGGCAG 
TTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAACGGCAG 
CTGACGCTCA ACAATGGCAG 
CTCACCCTGA ACAACGGCAG 
CTCACCCTGA ACAACGGAAG 
CTAACACTCA ACAACGGTAG 
CTGACGCTCA ACAATGGAAG 
CTGACTCTGA ACAATGGCAG 
CTGACTCTGA ACAATGGCAG 
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42 2 


TCAGTCTGTG 


42^8 


TCAGGCCG'VG 


42 Ts 
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Z Z O Z 


PPa ZlTPPPTZi 


9 9*^ /t 
Z Z O 4 


/^PaZXTPPPTZi 


9 9 ^ 
Z Z O D 




Z Z O D 




223 7 


CCAATCGGTA 


A3 4 


CCAAGCGGTA 


A3 5 


CCAAGCGGTA 


A3 7 


CCAAGCGGTA 


A3 3 


CCAAGCGGTA 


42_12 


TCAGGCCGTG 


AAVl 


CCAAGCCGTG 


AAV2 


TCAGGCAGTA 


AAV3 


TCAAGCGGTG 


AAV 8 


TCAGGCCGTG 


AAV 9 


TC7VAGCGTTA 


AAV7 


TCAGTCTGTG 


44 2 


TCAGGCCGTG 



Fig. lAAQ 



GGACGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGACGTTCTT CATTCTACTG 
GGACGTTCTT CATTCTACTG 
GGACGTTCTT CATTCTACTG 
GGACGTTCTT CATTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGACGTTCAT CCTTTTACTG 
GGACGCTCTT CATTTTACTG 
GGACGCTCAT CCTTTTACTG 
GGACGCTCCT CCTTCTACTG 
GGACGTTCTT CTTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 



3450 

CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAATAC TTCCCTTCTC 
CCTGGAATAC TTCCCTTCTC 
CCTGGAATAC TTCCCTTCTC 
TCTGGAGTAT TTCCCATCGC 
TCTGGAGTAT TTCCCATCGC 
TCTGGAGTAT TTCCCATCGC 
TCTGGAGTAT TTCCCATCGC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
TCTAGAGTAT TTTCCCTCTC 
TCTAGAGTAT TTTCCCTCTC 
TCTAGAGTAT TTTCCCTCTC 
TCTAGAGTAT TTTCCCTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAATAT TTCCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTCCCTTCGC 
CCTGGAATAC TTTCCTTCGC 
TCTGGAATAC TTCCCTTCTC 
CCTGGAGTAC TTCCCCTCTC 
CCTGGAGTAC TTTCCTTCTC 
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3451 


42 2 


AGATGCTGAG 


42 8 


AAATGCTGAG 


42 15 


AAATGCGGAG 


42 5b 


AAATGCTGAG 


42 lb 


AAATGCTGAG 


42 13 


AAATGCTGAG 


42 3a 


AAATGCTGAG 


4 2_4 


AAATGCTGAG 


42 5a 


AGATGCTGAG 


42_10 


AGATGCTGAG 


42_3b 


AGATGCTGAG 


42_11 


AGATGCTGAG 


42_6b 


AGATGCTGAG 


4 3__1 


AAATGCTGAG 


4 3__5 


AAATGCTGAG 


43 12 


AAATGCTGAG 


4 3_2 0 


AGATGCTGAG 


43__21 


AGATGCTGAG 


4 3_2 3 


AGATGCCGAG 


4 3__2 5 


AGATGCTGAG 


44_1 


AAATGCTGAG 


4 4_5 


AAATGCTGAG 


223_10 


AGATGCTGAG 


223__2 


AGATGCTGAG 


223__4 


AGATGCTGAG 


22 3_5 


AGATGCTGAG 


223 6 


AGATGCTGAG 


223 7 


AGATGCTGAG 


A3 4 


AGATGCTGAG 


A3 5 


AGATGCTGAG 


A3 7 


AGATGCTGAG 


A3 3 


AGATGCTGAG 


42_12 


AAATGCTGAG 


AAVl 


AGATGCTGAG 


AAV2 


AGATGCTGCG 


AAV 3 


AGATGCTAAG 


AAV 8 


AGATGCTGAG 


AAV 9 


AGATGCTGAG 


AAV7 


AGATGCTGAG 


44 2 


AAATGCTGAG 



Fig. lAAR 



AACGGGCAAT AACTTTGAAT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
7\ACCGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AJ\CTTTACCT 
TACCGGAAAC AACTTTACCT 
GACTGGAAAT AACTTCCAAT 
AACGGGCAAC AACTTCCAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 



3500 

TCAGCTACAC CTTTGAGGAA 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC TTTTGAGGAC 
TCAGCTATAC CTTCGAGGAT 
TTACTTACAC CTTCGAGGAC 
TCAGCTACAC TTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACCA GTTTGAGGAC 
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3501 


42 2 


GTGCCTTTCC 


42 8 


GTGCCTTTTC 


42 15 


GTGCCTTTTC 


42 5b 


GTGCCTTTTC 


42 lb 


GTGCCTTTTC 


42 13 


GTGCCTTTTC 


42 3a 


GTGCCTTTTC 


42 4 


GTGCCTTTTC 


42 5a 


GTGCCCTTTC 


42_10 


GTGCCTTTCC 


42_3b 


GTGCCTTTCC 


42_11 


GTGCCTTTCC 


42_6b 


GTGCCTTTCC 


43_1 


GTGCCTTTCC 


43 5 


GTGCCTTTCC 


43 12 


GTGCCTTTCC 


4 3_2 0 


GTGCCTTTCC 


43 21 


GTGCCTTTCC 


43 23 


GTGCCTTTCC 


43 25 


GTGCCTTTCC 


44 1 


GTGCCTTTTC 


44 5 


GTGCCTTTTC 


223 10 


GTGCCTTTCC 


223 2 


GTGCCTTTCC 


223 4 


GTGCCTTTCC 


223 5 


GTGCCTTTCC 


223 6 


GTGCCTTTCC 


223 7 


GTGCCTTTCC 


A3 4 


GTGCCTTTCC 


A3 5 


GTGCCTTTCC 


A3 7 


GTGCCTTTCC 


A3 3 


GTGCCTTTCC 


42__12 


GTGCCTTTTC 


AAVl 


GTGCCTTTCC 


AAV2 


GTTCCTTTCC 


AAV 3 


GTACCTTTTC 


AAV 8 


GTGCCTTTCC 


AAV 9 


GTGCCTTTCC 


AAV7 


GTGCCTTTCC 


44 2 


GTGCCTTTTC 



Fig. lAAS 



ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCATAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCATAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCTCACAGC 
ACAGCAGCTA CGCTCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCACACAGC 
ACAGCAGCTA CGCACACAGC 
ACAGCAGCTA CGCGCACAGC 



3550 

CAGAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG GCCGGCTGAT 
CAGAGTCTGG GCCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAAAGCCTGG ACCGGCTGAC 
CAGAGCCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGTCTCAT 
CAGAGTTTGG ATCGCTTGAT 
CAGAGCTTGG ACCGGCTGAT 
CAGAGTCTAG ATCGACTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
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GAATCCCCTC 


ATCGACCAGT 


42 8 


GAACCCCCTC 


ATCGACCAGT 
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GAACCCCCTC 


ATCGACCAGT 
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ATCGACCAGT 
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42 10 


GAATCCCCTC 


ATCGACCAGT 
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GAATCCCCTC 


ATCGACCAGT 
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GAATCCCCTC 


ATCGACCAGT 
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GAATCCCCTC 


ATCGACCAGT 


4 3__1 


GAACCCTCTC 


ATCGACCAGT 


4 3_5 


GAACCCTCTC 


ATCGACCAGT 


43_12 


GAACCCTCTC 


ATCGACCAGT 


4 3__20 


GAATCCCCTC 


ATCGACCAGT 


4 3__21 


GAATCCCCTC 


ATCGACCAGT 


4 3__23 


GAATCCCCTC 


ATCGACCAGT 


43 25 


GAATCCCCTC 


ATCGACCAGT 


4 4 1 


GAACCCCCTC 


ATCGACCAGT 


4 4 5 


GAACCCCCTC 


ATCGACCAGT 


223 10 


GAATCCCCTC 
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o o o o 
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O O O A 
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ATCGACCAGT 


223 7 


GAATCCCCTC 


ATCGACCAGT 


A3 4 


GAATCCTCTC 


ATTGACCAGT 


A3 5 


GAATCCTCTC 


ATTGACCAGT 


A3 7 


GAATCCTCTC 


ATTGACCAGT 


A3 3 


GAATCCTCTC 


ATTGACCAGT 


42_12 


GAACCCCCTC 


ATCGACCAGT 


AAVl 


GAATCCTCTC 


ATCGACCAAT 


AAV2 


GAATCCTCTC 


ATCGACCAGT 


AAV3 


GAATCCTCTT 


ATTGATCAGT 


AAV8 


GAATCCTCTC 


ATTGACCAGT 


AAV 9 


GAACCCCCTC 


ATCGACCAGT 


AAV7 


GAATCCCCTC 


ATCGACCAGT 


44 2 


GAACCCCCTC 


ATCGACCAGT 



Fig. lAAT 

3600 

ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTATTA CTTATCCAGA ACTCAGTCCA 
ACCTGTATTA CTTATCCAGA ACTCAGTCCA 
ACCTGTATTA CTTATCCAGA ACTCAGTCCA 
ACCTGTACTA CCTGGTCAGA ACGCAAACGA 
ACCTGTACTA CCTGGTCAGA ACGCAAACGA 
ACCTGTACTA CCTGGTCAGA ACGCAAACGA 
ACCTGTACTA CCTGGTCAGA ACGCAAACGA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTATTA CCTGAGCAAA ACTCAGGGTA 
ACCTGTATTA CCTGAGCAAA ACTCAGGGTA 
ACCTGTATTA CCTGAGCAAA ACTCAGGGTA 
ACCTGTATTA CCTGAGCAAA ACTCAGGGTA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTATTA CCTGAACAGA ACTCAAA.AT 
ACCTGTATTA CTTGAGCAGA ACAAACACTC 
ATCTGTACTA CCTGAACAGA ACGCAAGGAA 
ACCTGTACTA CTTGTCTCGG ACTCAAACAA 
ACCTATACTA CCTGGTCAGA ACACAGACAA 
ACTTGTACTA CCTGGCCAGA ACACAGAGTA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
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TCCACAAGGG AGCTGCA.GT TCCA TCAGGCTGGG 

TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 

GTCCACAAGG GAGCTGCAGT TCCA TCAGGCTGGG 

GTCCACAAGG GAGCTGCAGT TCCA TCAGGCTGGG 

GTCCACAAGG GAGCTGCAGT TCCA TCAGGCTGGG 

GTCCACAAGG GAGCTGCAGT TCCA TCAGGCTGGG 

AACTCAAGGT ACTCAGCAAT TGTTATTTTC TCAAGCCGGG 
AACTCAAGGT ACTCAGCAAT TGTTATTTTC TCAAGCCGGG 
AACTCAAGGT ACTCAGCAAT TGTTATTTTC TCAAGCCGGG 
AACTGGAGGG ACGCAGACTC TGGCATTCAG CCAAGCGGGT 
AACTGGAGGG ACGCAGACTC TGGCATTCAG CCAAGCGGGT 
AACTGGAGGG ACGCAGACTC TGGCATTCAG CCAAGCGGGT 
AACTGGAGGG ACGCAGACTC TGGCATTCAG CCAAGCGGGT 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
TACTGCTGGC AATCGGGAAC TGCAGTTTTA TCAGGGCGGA 
TACTGCTGGC AATCGGGAAC TGCAGTTTTA TCAGGGCGGA 
TACTGCTGGC AATCGGGAAC TGCAGTTTTA TCAGGGCGGA 
TACTGCTGGC AATCGGGAAC TGCAGTTTTA TCAGGGCGGA 
TACTGCTGGC AATCGGGAAC TGCAGTTTTA TCAGGGCGGA 
TACTGCTGGC AATCGGGAAC TGCAGTTTTA TCAGGGCGGA 
AACAACGCAG CAATCGAGAC TGCAGTTCAG CCAAGCTGGG 
AACAACGCAG CAATCGAGAC TGCAGTTCAA CCAAGCTGGG 
AACAACGCAG CAATCGAGAC TGCAGTTCAG CCAAGCTGGG 
AACAACGCAG CAATCGAGAC TGCAGTTCAG CCAAGCTGGG 

GTCCACAAGG GGGCTGCAGT TCCA TCAGGCTGGG 

AAGTGCCCAA AACAAGGACT TGCTGTTTAG CCGTGGGTCT 
AACCACCACG CAGTCAAGGC TTCAGTTTTC TCAGGGCGGA 
AACAACCAAC CAATCACGGC TGCTTTTTAG CCAGGCTGGG 
.ACGGCAAAT ACGCAGACTC TGGGCTTCAG CCAAGCTGGG 
.ACTGGGGGA ACTCAAACTT TGGCATTCAG CC7VAGCAGGC 
CACAGCTGGC AATCGGGAAC TGCAGTTTTA CCAGGGCGGG 
TACCGCAGGA ACTCAGCAGT TGCTATTTTC TCAGGCCGGG 
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Fig . lAAV 



TGGCCGAGCA ATCAAAGAAC 
TGTCGGCTCA GGCCAAAAAC 

TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGGCCGAGCA ATCAAAGAAC 
TGGCCGAGCA ATCAAAGAAC 
TGGCCGAGCA ATCAAAGAAC 
TGGCCGAGCA ATCAAAGAAC 
TGTCGGCTCA GGCCAAGAAC 
TGTCGGCTCA GGCCAAGAAC 
TGTCGGCTCA GGCCAAGAAC 
TGGCCAACCA GGCTAGAAAT 
TGGCCAACCA GGCTAGAAAT 
TGGCCAACCA GGCTAGAAAT 
TGGCCAACCA GGCTAGAAAT 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCTCAGCA GGCCAAAAAC 
TGGCTCAGCA GGCCAAAAAC 
TGGCTCAGCA GGCCAAAAAC 
TGGCTCAGCA GGCCAAAAAC 
TGGCCGAGCA ATCAAAGAAC 
TGTCTGTTCA GGCCAAAAAC 
TTCGGGACCA GTCTAGGAAC 
TGTCTTTGCA GGCCAGAAAT 
TGGCCAATCA GGCAAAGAAC 
TGGCCAATCA GGCTAGAAAC 
TGGCCGAACA AGCCAAGAAT 
TGTCGGCTCA GGCCAAAAAC 



3700 

TGGCTGCCCG GACCCTGTTA 
TGGCTACCCG GGCCCTGCTA 

TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTACCTG GACCGTGTTA 
TGGCTACCTG GACCGTGTTA 
TGGCTACCTG GACCGTGTTA 
TGGGTGCCCG GACCTTGCTA 
TGGGTGCCCG GACCTTGCTA 
TGGGTGCCCG GACCTTGCTA 
TGGGTGCCCG GACCTTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTACCCG GACCCAGCTA 
TGGCTACCCG GACCCAGCTA 
TGGCTACCGG GACCCAGCTA 
TGGCTACCGG GACCCAGCTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTACCTG GACCCTGTTA 
TGGCTTCCTG GACCCTGTTA 
TGGCTACCTG GGCCCTGCTA 
TGGCTGCCAG GACCCTGTTA 
TGGCTACCCG GGCCTTGCTA 
TGGTTACCTG GACCTTGCTT 
TGGCTACCCG GGCCCTGCTA 
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Fig, lAAW 



AGACTGTCAA AAAACATAGA 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACAGTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
AGACTGTCAA AAAACATAGA 
AGACTGTCAA AAAACATAGA 
AGACTGTCAA AAGACATAGA 
AGACTGTCi^A AAAACATAGA 
CGAGTTTCCA CGACACTGTC 
CGAGTTTCCA CGACACTGTC 
CGAGTTTCCA CGACACTGTC 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
CGAATGTCTA AGACGGCTAA 
CGAATGTCTA AGACGGCTAA 
CGAATGTCTA AGACGGCTAA 
CG7VATGTCTA AGACGGCTAA 
AGACTGTCAA AAAACATAGA 
CGCGTTTCTA AAACAAAAAC 
CGAGTATCAA AGACATCTGC 
AGACTTTCAA AGACTGCTAA 
CGCGTCTCAA CGACAACCGG 
CGCGTCTCCA CAACCACCAA 
AGAGTCTCCA AAACGCTGGA 
CGCGTCTCCA CGACACTGTC 



3750 

CAGCAACAAC AACAGTAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
CAGCAACAAC AACAGTAACT 
CAGCAACAAC ACCAGTAACT 
CAGCAACAAC AACAGTAACT 
CAGCAACAAC AACAGTAACT 
GCAAAACAAC AACAGCAATT 
GCAAAACAAC AACAGCAATT 
GCAAAACAAC AACAGCAATT 
CCAGAACAAC AACAGCAACT 
CCAGAGCAAC AACAGCAACT 
CCAGAACAAC AACAGCAACT 
CCAGAACAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCMCT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TGACAACAAC AACAGTG/^JVT 
TGACAACAAC AACAGTGAAT 
TGACAACAAC AACAGTGAAT 
TGACAACAAC AACAGTGAAT 
CAGCAACAAC AACAGTAACT 
AGACAACAAC AACAGCAATT 
GGATAACAAC AACAGTGAAT 
CGACAACAAC AACAGTAACT 
GCAAAACAAC AATAGCAACT 
GCAAAATAAC AACAGCAACT 
TCAAAACAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
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Fig, lAAX 



CGGGGCCACT AAATACCATC 
CGGTGCCACC AAGTATCATC 
CGGTGCCACC AAGTATCATC 
CGGTGCCACC AAGTATCATC 
CGGTGCCACC AAGTATCATC 
CGGTGCCACC AAGTATCATC 
CGGTGCCACC AAGTATCATC 
CGGTGCCACC AAGTATCATC 
CGGTGCCACC AAGTATCATC 
CGGGGCCACT AAATACCATC 
CGGGGCCACT AAATACCATC 
CGGGGCCACT A7\ATACCATC 
CGGGGCCACT AAATACCATC 
CGGTGCCACC AAGTATCACC 
CGGTGCCACC AAGTATCACC 
CGGTGCCACC AAGTATCACC 
GGGAGCTGCC AAGTTTAAGC 
GGGAGCTGCC AAGTTTAAGC 
GGGAGCTGCC AAGTTTAAGC 
GGGAGCTGCC AAGTTTAAGC 
CGGTGCCACC AAGTATCATC 
CGGTGCCACC AAGTATCATC 
TGGTGCCACA AAATACCATT 
TGGTGCCACA AAATACCATT 
TGGTGCCACA AAATACCATT 
TGGTGCCACA AAATACCATT 
TGGTGCCACA AAATACCATT 
TGGTGCCACA AAATACCATT 
TGCAGCCACC AAATATTACC 
TGCAGCCACC AAATATTACC 
TGCAGCCACC AAATATTACC 
TGCAGCCACC AAATATTACC 
CGGGGCCACT AAATACCATC 
TGGTGCTTCA AAATATAACC 
TGGAGCTACC AAGTACCACC 
AGCGGCCAGC AAATATCATC 
TGCTGGGACC AAATACCATC 
GGGAGCTGCT AAATTCAAGC 
TGGTGCCACC AAATATCACC 
CGGTGCCACC AAGTATCATC 



3800 

TGAATGGTAG AAATTCATTA 
TGAATGGCAG AGACTCTCTG 
TGAATGGCAG AGACTCTCTG 
TGAATGGCAG AGACTCTCTG 
TGAATGGCAG AGACTCTCTG 
TGAATGGCAG AGACTCTCTG 
TGAATGGCAG AGACTCTCTG 
TGAATGGCAG AGACTCTCTG 
TGAATGGCAG AGACTCTCTG 
TGAATGGTAG AAATTCATTA 
TGAATGGTAG AAATTCATTA 
TGAATGGTAG AAATTCATTA 
TGAATGGTAG AAATTCATTA 
TGAATGGCAG AGACTCCCTG 
TGAATGGCAG AGACTCCCTG 
TGAATGGCAG AGACTCCCTG 
TGAACGGCCG AGACTCTCTA 
TGAACGGCCG AGACTCTCTA 
TGAACGGCCG AGACTCTCTA 
TGAACGGCCG AGACTCTCTA 
TGAATGGCAG AGACTCTCTG 
TGAATGGCAG AGACTCTCTG 
TAAATGNAAG AAATTCATTG 
TAAATGGAAG AAATTCATTG 
TAAATGGAAG AAATTCATTG 
TAAATGGAAG AAATTCATTG 
TAAATGGAAG AAATTCATTG 
TAAATGGAAG AAATTCATTG 
TGAATGGAAG AAATTCTCTG 
CGAATGGAAG AAATTCTCTG 
TGAATGGAAG AAATTCTCTG 
TG7VATGGAAG AAATTCTCTG 
TGAATGGTAG AAATTCATT/-\ 
TCAATGGGCG TGAATCCATC 
TGAATGGCAG AGACTCTCTG 
TCAATGGGCG CGACTCGCTG 
TGAATGGAAG AAATTCATTG 
TGAACGGGAG AGACTCGCTA 
TGAACGGCAG AAACTCGTTG 
TGAATGGCAG AGACTCTCTG 
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Fig. lAAY 



GCGTAGCCAT GGCCACCAAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GCGTAGCCAT GGCCACCAAC 
GCGTAGCCAT GGCCACCAAC 
GCGTAGCCAT GGCCACCAAC 
GCGTAGCCAT GGCCACCAAC 
GCGTTGCCAT GGCTACCCAC 
GCGTTGCCAT GGCTACCCAC 
GCGTTGCCAT GGCTACCCAC 
GCGTGGCAAT GGCTTCCCAC 
GCGTGGCAAT GGCTTCCCAC 
GCGTGGCAAT GGCTTCCCAC 
GCGTGGCAAT GGCTTCCCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCCAT GGCAACGCAC 
GTGTCGCCAT GGCAACGCAC 
GTGTCGCCAT GGCAACGCAC 
GTGTCGCCAT GGCAACGCAC 
GTGTCGCCAT GGCAACGCAC 
GTGTCGCCAT GGCAACGCAC 
GGCCCCCAAT GGCCAGTCAC 
GGCCCCCAAT GGCCAGTCAC 
GGCCCCCAAT GGCCAGTCAC 
GGCCCCCAGT GGCCAGTCAC 
GCGTAGCCAT GGCCACCAAC 
GCACTGCTAT GGCCTCACAC 

GGCC AT GGCAAGCCAC 

GACCAGCTAT GGCCAGTCAC 
GCATCGCTAT GGCAACACAC 
GCGTGGCTAT GGCATCGCAC 
GCGTCGCCAT GGCAACTCAC 
GTGTCGCTAT GGCAACGCAC 
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AAGGACGACG AGGACCAGTT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGGCGACG AAGAGCGATT 
AAGGGCGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
7\AGGACGACG AGGACCAGTT 
AAGGACGACG AGGACCAGTT 
AAGGACGACG AGGACCAGTT 
AAGGACGACG AGGACCAGTT 
AAGGACGACG AGGAGCGCTT 
AAGGACGACG AGGAGCGCTT 
AAGGACGACG AGGAGCGCTT 
AAGGATGACG AGGAGCGCTT 
AAGGATGACG AGGAGCGCTT 
AAGGATGACG AGGAGCGCTT 
AAGGATGACG AGGAGCGCTT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGATG AGGAAAAGTA 
AAGGACGATG AGGAAAAGTA 
AAGGACGATG AGG7\AAAGTA 
AAGGACGATG AGGAAAAGTA 
AAGGACGACG AGGACCAGTT 
AAAGACGACG AAGACAAGTT 
AAGGACGATG AAGAAAAGTT 
AAGGACGATG AAGAAAAATT 
AAAGACGACG AGGAGCGTTT 
AAAGACGACG AGGAGCGCTT 
AAGGACGACG AGGAGCGCTT 
AAGGACGACG AAGAGCGATT 
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Fig. lAAZ 



AACGGAGTGC TGGTTTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AACGGAGTGC TGGTTTTTGG 
AACGGAGTGC TGGTTTTTGG 
AACGGAGTGC TGGTTTTTGG 
AACGGAGTGC TGGTTTTTGG 
AGCGGAGTTC TAATGTTTGG 
AGCGGAGTTC TAATGTTTGG 
AGCGGAGTTC TAATGTTTGG 
AGCGGGGTCC TGATTTTTGG 
AGCGGGGTCC TGATTTTTGG 
AGCGGGGTCC TGATTTTTGG 
AGCGGGGTCC TGATTTTTGG 
AGCGGAGTCT TAATGTTTGG 
AGCGGAGTCT TAATGTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
CACGGAAATC TGATGTTTGG 
CACGGAAATC TGATGTTTGG 
CACGGAAATC TGATGTTTGG 
CACGGAAATC TGATGTTTGG 
AACGGAGTGC TGGTTTTTGG 
AGCGGTGTCA TGATTTTTGG 
AGCGGGGTTC TGATGTTTGG 
CACGGCAATC TAATATTTGG 
AACGGGATCC TGATTTTTGG 
AGTGGCGTTC TCATATTTGG 
AGCGGAGTCC TGATTTTTGG 
AGCGGAGTCT TAATGTTTGG 



3900 

CGAAACGGGG GCTGCCAACA 

GAAACAGGGA GCTGGAAA. . 

GAAACAGGGA GCTGGAAA, . 

GAAACAGGGA GCTGGAAA.. 

GAAACAGGGA GCTGGA7\A . . 

GAAACAGGGA GCTGGAAA.. 

GAAACAGGGA GCTGGAAA . . 

GAAACAGGGA GCTGGAAA . . 

GAAACAGGGA GCTGGAAA . . 

CAAAACGGGG GCTGCCAACA 

CAAAACGGGG GCTGCCAACA 

CAAAACGGGG GCTGCCAACA 

CAAAACGGGG GCTGCCAACA 

CAAGCAGGGG GCTGGAAA. . 

CAAGCAGGGG GCTGGZ\AA. . 

CAAGCAGGGG GCTGGAAA. . 

CAAGCAAGGA GCCGGGAA. . 
CAAGCAAGGA GCCGGGAA. . 
CAAGCAAGGA GCCGGGAA, . 

CAAGCAAGGA GCCGGGAA. . 

GAAACAGGGA GCTGGAAA. . 

GAAACAGGGA GCTGGAAA. , 

CAAAACTGGA GCAGCTAATA 

CAAAACTGGA GCAGCTAATA 

CAAAACTGGA GCAGCTAATA 

CAAAACTGGA GCAGCTAATA 

CAAAACTGGA GCAGCTAATA 

CAAAACTGGA GCAGCTAATA 

AAAACAAGGC ACAGGAAC . . 

AAAACAAGGC ACAGGAAC . . 

AAAACAAGGC ACAGGAAC . . 

AAAACAAGGC ACAGGAAC . . 

CAAAACGGGG GCTGCCAACA 

AAAAGAGAGC GCCGGAGC . . 

GAAGCT^GGC TCAGAGAA. . 

CAAAGAAGGG ACAACGGC . . 

CAAACAAAAT GCTGCCAG . . 
CAAGCAAGGA GCCGGGAA. . 
AAAAACTGGA GCAACTAACA 
GAAACAGGGA GCTGGAAA. . 
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Fig. lAAAA 

3950 

AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AGCGTGATGC TGACCAGCGA AGAAGAAATT 
AGCGTGATGC TGACCAGCGA AGAAGAAATT 
AGCGTGATGC TGACCAGCGA AGAAGAAATT 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
CAAGTGCTGA TTACAGATGA GG7v?\GAAATC 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
AACGTGCTCA TGACAAATGA AGT^GAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AATGTCATGA TTACAGACGA AGAGGAAATT 
AAGGTCATGA TTACAGACGA AGAGGAAATC 
AATGTAATGA TTACGGATGA AGAAGAGATT 
GATGTCATGC TGACCAGCGA GGAAGAAATC 
CAGGTGCTGA TTACAGATGA GGAAGAAATT 
AATGTGTTAA TGACAAATGA AGAAGA/vATT 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
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Fig . lAAAB 



ATCCCGTGGC TACAGAAGAA 
ACCCAGTGGC CACAGAACAG 
ACCCAGTGGC CACAGAACAG 
ACCCAGTGGC CACAGAACAG 
ACCCAGTGGC CACAGAACAG 
ACCCAGTGGC CACAGAACAG 
ACCCAGTGGC CACAGAACAG 
ACCCAGTGGC CACAGAACAG 
ACCCAGTGGC CACAGAACAG 
ATCCCGTGGC TACAGAAGAA 
ATCCCGTGGC TACAGAACAG 
ATCCCGTGGC TACAGAAGAA 
ATCCCGTGGC TACAGAAGAA 
ACCCAGTGGC TACAGAGCAG 
ACCCAGTGGC TACAGAGCAG 
ACCCAGTGGC TACAGAGCAG 
ACCCCGTGGC CACAGAAGAA 
ACCCCGTGGC CACAGAAGAA 
ACCCCGTGGC CACAGAAGAA 
ACCCCGTGGC CACAGAAGAA 
ACCCAGTGGC CACGGAACAG 
ACCCAGTGGC CACAGAACAG 
ACCCGGTAGC TACCGAGGAA 
ACCCGGTAGC TACCGAGGAA 
ACCCGGTAGC TACCGAGGAA 
ACCCGGTAGC TACCGAGGAA 
ACCCGGTAGC TACCGAGGAA 
ACCCGGTAGC TACCGAGGAA 
ATCCTGTGGC TACAGAAGAA 
ATCCTGTGGC TACAGAAGAA 
ATCCTGTGGC TACAGAAGAA 
ATCCTGTGGC TACAGZ\ACAA 
ATCCCGTGGC TACAGAAGAA 
ACCCTGTGGC CACCGAAAGA 
ATCCCGTGGC TACGGAGCAG 
ATCCTGTGGC AACAGAGCAG 
ACCCTGTGGC TACAGAGGAA 
ACCCTGTAGC CACAGAGGAA 
ATCCTGTAGC CACGGAAGAA 
ACCCAGTGGC CACAGAACAG 



4000 

TACGGTGTGG TCTCCAGCAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TATGGTGTGG TGGCAGACAA 
TATGGTGTGG TGGCAGACAA 
TATGGTGTGG TGGCAGACAA 
TATGGAGCAG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGTGTGG TCTCCAGCAA 
TTTGGGACCG TGGCAGTCAA 
TATGGTTCTG TATCTACCAA 
TATGGAACTG TGGCAAATAA 
TACGGTATCG TGGCAGATAA 
TACGGAGCAG TGGCCATCAA 
TACGGGATAG TCAGCAGCAA 
TACGGCGTGG TGGCCGAT7VA 
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Fig. lAAAC 



TCTACGGCCG GACCCCAGAC 
CAAAACGCCG CTCCTATTGT 
CAAAACGCCG CTCCTATTGT 
CAAAACGCCG CTCCTATTGT 
CAAAACGCCG CTCCTATTGT 
CAAAACGCCG CTCCTATTGT 
CAAAACGCCG CTCCTATTGT 
CAAAACGCCG CTCCTATTGT 
CAAAACGCCG CTCCTATTGT 
TCTACGGCCG GACCCCAGAC 
TCTACGGCCG GACCCCAGAC 
TCTACGGCCG GACCCCAGAC 
TCTACGGCCG GACCCCAGAC 
ACCAACGGAG CTCCCATTGT 
ACCAACGGAG CTCCCATTGT 
ACCAACGGAG CTCCCATTGT 
GCCAATACGC AGGCGCAGAC 
GCCTyVTACGC AGGCGCAGAC 
GCCAATACGC AGGCGCAGAC 
GCCAATACGC AGGCGCAGAC 
CAAAACGCCG CTCCTATTGT 
CAAAACGCCG CTCCTATTGT 
GCTAGCACCG CAGCCCAGAC 
GCTAGCACCG CAGCCCAGAC 
GCTAGCACCG CAGCCCAGAC 
GCTAGCACCG CAGCCCAGAC 
GCTAGCACCG CAGCCCAGAC 
GCTAGCACCG CAGCCCAGAC 
CAGGACACCA CAGCTTCCTA 
CAGAACACCA CAGCTTCCTA 
CAGAACACCA CAGCTTCCTA 
CAGAACACCA CAGCTTCCTA 
TCTACGGCCG GACCCCAGAC 
AGCAGCACAG ACCCTGCGAC 
GGCAACAGAC AAGCAGCTAC 
TCAAATACAG CTCCCACGAC 
CAAAACACGG CTCCTC7VAAT 
GCTAACACGC AGGCGCAAAC 
GCTAATACTG CAGCCCAGAC 
CAAAACGCCG CTCCTATTGT 



4050 

ACAGACTGTC AACAGCCAGG 
AGGGGCCGTC AACAGTCAAG 

AGGGGCCGTC AACAGTCAAG 
AGGGGCCGTC AACAGTCAAG 
AGGGGCCGTC AACAGTCAAG 
AGGGGCCGTC AACAGTCAAG 
AGGGGCCGTC AACAGTCAAG 
AGGGGCCGTC AACAGTCAAG 
AGGGGCCGTC AACAGTCAAG 
ACAGACTGTC AACAGCCAGG 
ACAGACTGTC AACAGCCAGG 
ACAGACTGTC AACAGCCAGG 
ACAGACTGTC AACAGCCAGG 
GGGAACTGTC AACAGCCAGG 
GGGAACTGTC AACAGCCAGG 
GGGAACTGTC AACAGCCAGG 
CGGACTCGTG CACAACCAGG 
CGGACTCGTG CACAACCAGG 
CGGACTCGTG CACAACCAGG 
CGGACTCGTG CACAACCAGG 
AGGGGCCGTC AACAGTCAAG 
AGGGGCCGTC AACAGTCAAG 
ACAAGTTGTT AACAACCAGG 
ACAAGTTGTT AACAACCAGG 
ACAAGTTGTT AACAACCAGG 
ACAAGTTGTT AACAACCAGG 
ACAAGTTGTT AACAACCAGG 
ACAAGTTGTT AACAACCAGG 
TGG7y\GTGTG GACAGCCAGG 
TGGAAGTGTG GACAGCCAGG 
TGGAAGTGTG GACAGCCAGG 
TGGAAGTGTG GACAGCCAGG 
ACAGACTGTC Z^ACAGCCAGG 
CGGAGATGTG CATGCTATGG 
CGCAGATGTC AACACACAAG 
TGGAAGTGTG AATCATCAGG 
TGGAACTGTC AACAGCCAGG 
TGGACTTGTG CATAACCAGG 
ACAAGTTGTC AACAACCAGG 
AGGGGCCGTC AACAGTCAAG 
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Fig. lAAAD 



CGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAG7V\CC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGCC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
TGGTATGGTC TGGCAAAACC 
TGGTATGGTC TGGCAAAACC 
TGGTATGGTC TGGCAAAACC 
CGGCATGGTC TGGCAGAATA 
CGGCATGGTC TGGCAGAATA 
CGGCATGGTC TGGCAGAATA 
CGGCATGGTC TGGCAGAATA 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
CGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAAGATA 
AGGCATGGTC TGGCAGGACA 
TGGCATGGTG TGGCZy\GATC 
CGGTATGGTC TGGCAGAACC 
TGGTATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 



4100 

GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGC 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GGGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGATGTGTA CCTTCAGGGG 
GTGACGTGTA CCTTCAAGGA 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
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PPP A APIA TTP 


2z J / 


ccc 


.ATTTGG 


PPP A API AT TP 


A3 4 


ccc 


.ATTTGG 


GCCAAAACTC 


A3 5 


ccc 


.ATTTGG 


GCCAAAACTC 


A3 7 


ccc 


.ATTTGG 


GCCAAAACTC 


A3 3 


ccc 


.ATTTGG 


GCCAAAACTC 


42_12 


ccc 


.ATCTGG 


GCCAAAATTC 


AAVl 


ccc 


.ATTTGG 


GCCAAAATTC 


AAV2 


ccc 


.ATCTGG 


GCAAAGATTC 


AAV 3 


CCT 


.ATCTGG 


GCAAAGATTC 


AAV 8 


ccc 


.ATCTGG 


GCCAAGATTC 


AAV 9 


CCCTATTTGG 


GCTAAAATAC 


AAV7 


ccc 


.ATCTGG 


GCC7\AGATTC 


44 2 


CCT 


.ATCTGG 


GCCAAGATTC 



Fig. lAAAE 

4150 

CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCZy\CTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGAAACTTT CATCCCTCGC 
CTCACACGGA CGGAAACTTT CATCCCTCGC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGACACTTT CATCCTTCTC 
CTCACACGGA CGGACACTTT CATCCTTCTC 
CTCACACGGA CGGACACTTT CATCCTTCTC 
CTCACACGGA CGGACACTTT CATCCTTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACAGA TGGACACTTT CACCCGTCTC 
CACACACGGA CGGACATTTT CACCCGTCTC 
CTCACACGGA TGGACACTTT CATCCTTCTC 
CTCACACGGA CGGCAACTTC CACCCGTCTC 
CTCACACAGA TGGCAACTTT CACCCGTCTC 
CTCACACGGA TGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGAAACTTT CATCCCTCGC 
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4151 


42_2 


CCCTGATGGG 


42_8 


CGCTGATGGG 


42__15 


CGCTGATGGG 


42_5b 


CGCTGATGGG 


42 lb 


CGCTGATGGG 


42 13 


CGCTGATGGG 


4 2_3a 


CGCTGAI GGG 


42_4 


CGCTGAi Gbb 


42_5a 


CGCTGATGGG 


42__10 


CCCTGATGGG 


42_3b 


CCCTGATGGG 


42_11 


CCCTGATGGG 


42_6b 


CCCTGATGGA 


4 3_1 


CGCTGATGGG 


4 3_5 


CGCTGATGGG 


43 12 


CGCTGATGGG 


43 20 


CCCTGATGGG 


43 21 


CCCTGATGGG 


43 23 


CCCTGATGGG 


43 25 


CCCTGATGGG 


44 1 


CGCTGATGGG 


44 5 


CGCTGATGGG 


223 10 


CTCTAATGGG 


223 2 


CTCTAATGGG 


223 4 


CTCTAATGGG 


223 5 


CTCTAATGGG 


223 6 


CTCTAATGGG 


223 7 


CTCTAATGGG 


A3 4 


CGCTGATGGG 


A3 5 


CGCTCATGGG 


A3 7 


CGCTGATGGG 


A3 3 


CGCTCATGGG 


42__12 


CCCTGATGGG 


AAVl 


CTCTTATGGG 


AAV2 


CCCTGATGGG 


AAV 3 


CTCTGATGGG 


AAV 8 


CGCTGATGGG 


AAV 9 


CTCTGATGGG 


AAV7 


CTTTGATGGG 


44 2 


CGCTGATGGG 



Fig. lAAAF 



CGGATTTGGA CTCAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTG7\AACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGATTTGGA CTCAAACACC 
CGGCTTTGGA CTCAAGAACC 
TGGATTCGGA CTTZ^VACACC 
AGGCTTTGGA CTGAAACATC 
CGGCTTTGGC CTGAAACATC 
TGGATTTGGA CTGAAACACC 
CGGCTTTGGA CTTAAACATC 
AGGCTTTGGA CTGAAACACC 



4200 

CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGATC 
CTCCTCCACA GATTCTCATC 
CGCCTCCTCA AATCATGATC 
CTCCGCCTCA GATCCTGATC 
CACCTCCACA GATTCTAATT 
CGCCTCCTCA GATCCTGATC 
CGCCTCCTCA GATCCTGATT 
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Fig. IhAAG 



4 2_2 
42_8 
42_15 
4 2_5b 
42_lb 
42_13 
42_3a 
42_4 
42_5a 
42_10 
4 2_3b 
42_11 
42_6b 
43_1 
4 3_5 
4 3_12 
4 3_20 
43_21 
43_23 
43__25 
44_1 
4 4_5 
223__10 
2 23_2 
223_4 
22 3_5 
223_6 
223_7 
A3__4 
A3_5 
A3_7 
A3_3 
42_12 
AAVl 
AAV2 
AAV3 
AAV 8 
AAV 9 
AAV7 
44 2 



4201 

AAAAACACCC 
AAGAATACAC 
AAGAATACAC 
AAGAATACAC 
AAGAATACAC 
AAGAATACAC 
AAGAATACAC 
AAGAATACAC 
AAGAATACAC 
AAAAACACCC 
AAAAACACCC 
AAAAACACCC 
AAAAACACCC 
AAAAACACTC 
AAAAACACTC 
AAAAACACTC 
AAGAACACAC 
AAGAACACAC 
AAGAACACAC 
AAGAACACAC 
AAGAATACAC 
AAGAATACAC 
AAAAACACAC 
AAAAACACCC 
AAAAACACAC 
AAAAACACAC 
AAAAACACAC 
AAAAACACAC 
AAAAACACAC 
AAAAACACAC 
AAAAACACAC 
AAAAACACAC 

A, . .A 

AAAAACACCC 
AAGAACACCC 
AAAAATACTC 
AAGAACACCC 
AAAAATACAC 
AAGAACACTC 
AAGAATACAC 



CGGTACCTGC 
CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
CGGTACCTGC 
CGGTACCTGC 
CGGTACCTGC 
CGGTACCTGC 
CTGTTCCTGC 
CTGTTCCTGC 
CTGTTCCTGC 
CGGTTCCAGC 
CGGTTCCAGC 
CGGTTCCAGC 
CGGTTCCAGC 
CTGTTCCCGC 
CTGTTCCCGC 
CGGTACCTGC 
CGGTACCTGC 
CGGTACCTGC 
CGGTACCTGC 
CGGTACCTGC 
CGGTACCTGC 
CTGTGCCAGC 
CTGTGCCAGC 
CTGTGCCAGC 
CTGTGCCAGC 



TAATCCTCCA 
GGATCCTCCA 
GGATCCTCCA 
GGATCCTCCA 
GGATCCTCCA 
GGATCCTCCA 
GGATCCTCCA 
GGATCCTCCA 
GGATCCTCCA 
TAATCCTCCA 
TAATCCTCCA 
TAATCCTCCA 
TAATCCTCCA 
GGATCCTCCG 
GGATCCTCCG 
GGATCCTCCG 
GGACCCGCCG 
GGACCCGCCG 
GGACCCGCCG 
GGACCCGCCG 
GGATCCTCCA 
GGATCCTCCA 
TAATCCTCCA 
TAATCCTCCA 
TAATCCTCCA 
TAATCCTCCA 
TAATCCTCCA 
TAATCCTCCA 
GAATCCCGCG 
GAATCCCGCG 
GAATCCCGCG 
GAATCCCGCG 



GAGGTGTTTA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
GAGGTGTTTA 
GAGGTGTTTA 
GAGGTGTTTA 
GAGGTGTTTA 
ACCACCTTCA 
ACCACCTTCA 
ACCACCTTCA 
CTTACCTTCA 
CTTACCTTCA 
CTTACCTTCA 
CTTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
GAAGTGTTTA 
GZ\AGTGTTTA 
GAAGTGTTTA 
GAAGTGTTTA 
GAAGTGTTTA 
GAAGTGTTTA 
ACCACTTTCA 
ACCACTTTCA 
ACCACTTTCA 
ACCACTTTCA 



4250 
CTCCTGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCTVAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
GCCAGGCCAA 
GCCAGGCCAA 
GCCAGGCCAA 
ACCAGGCCAA 
ACCAGGCCAA 
ACCAGGCCAA 
ACCAGGCCAA 
GTCAAGCTAA 
GTCAAGCTAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGGAAA 
CTCCTGGAAA 
CTCCTGGAAA 
CTCCTGGAAA 



CTGTTCCTGC 
CGGTACCTGC 

CGGTACCGGC 
CTGTACCTGC 
CAGTGCCGGC 
CCGTTCCCGC 
CTGTTCCCGC 



GAATCCTCCG 
GAATCCTTCG 
7VAATCCTCCG 
GGATCCTCCG 
AGATCCTCCT 
TAATCCTCCG 
GGATCCTCCA 



GCGGAGTTTT 
ACCACCTTCA 
ACCACTTTCA 
ACCACCTTCA 
CTTACCTTCA 
GAGGTGTTTA 
ACTACCTTCA 



CAGCTACA7y\ 
GTGCGGCAAA 
GCCCGGCCAA 
ACCAGTCAAA 
ATCAAGCCAA 
CTCCTGCCAA 
GTCAAGCTZU\ 
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Fig. lAAAH 

4251 ^300 

42_2 GTTTGCCTCA TTTATCACGC AGTACAGCAC CGGCCA.GGT CAGCGTGGAG 

42~8 GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

42_15 GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

42~5b GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

42 lb GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

42~13 GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

4 2_3a GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

42_4 GCCGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

42_5a GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

42_10 GTTTGCCTCA TTTATCACGC AGTACAGCAC CGGCCA.GGT CAGCGTGGAG 

42~3b GTTTGCCTCA TTTATCACGC AGTACAGCAC CGGCCA.GGT CAGCGTGGAG 

42__11 GTTTGCCTCA TTTATCACGC AGTACAGCAC CGGCCA.GGT CAGCGTGGAG 

42_6b GTTTGCCTCA TTTATCACGC AGTACAGCAC CGGCCA.GGT CAGCGTGGAG 

43_1 GCTGGCTTCT TTTATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

4 3~5 GCTGGCTTCT TTTATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

4 3 12 GCTGGCTTCT TTTATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

4 3~20 GCTGAACTCT TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

43J21 GCTGAACTCT TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

4 3~23 GCTGAACTCT TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

4 3_25 GCTGAACTCT TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

4 4_1 GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

4 4_5 GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 

22 3_10 GTTTGCTTCC TTCATCACGC AGTACAGCAC CGGGCA.AGT CAGCGTTGAG 

22 3 2 GTTTGCTTCC TTCATCACGC AGTACAGCAC CGGGCA.AGT CAGCGTTGAG 

223~4 GTTTGCTTCC TTCATCACGC AGTACAGCAC CGGGCA.AGT CAGCGTTGAG 

223~5 GTTTGCTTCC TTCATCACGC AGTACAGCAC CGGGCA.AGT CAGCGTTGAG 

22 3~6 GCTTGCTTCC TTCATCACGC AGTACAGCAC CGGGCA.AGT CAGCGTTGAG 

22 3_7 GATTGCTTCC TTCATCACGC AGTACAGCAC CGGGCA.AGT CAGCGTTGAG 

A3_4 GTTTGCTTCC TTCATTACCC AGTATTCCAC CGGACA.GGT CAGCGTGGAA 

A3~5 GTTTGCTTCC TTCATTACCC AGTATTCCAC CGGACA.GGT CAGCGTGGAA 

A3_7 GTTTGCTTCC TTCATTACCC AGTATTCCAC CGGACA.GGT CAGCGTGGAA 

A3_3 GTTTGCTTCC TTCATTACCC AGTATTCCAC CGGACA.GGT CAGCGTGGAA 

4212 

AAVl GTTTGCTTCA TTCATCACGC AATACTCCAC AGGACA.AGT GAGTGTGGAA 

AAV2 GTTTGCTTCC TTCATCACAC AGTACTCCAC GGGACACGGT CAGCGTGGAG 

AAV3 GTTTGCTTCA TTTATCACTC AGTACTCCAC TGGACA.GGT CAGCGTGGAA 

AAV8 GCTGAACTCT TTCATCACGC AATACAGCAC CGGACA.GGT CAGCGTGGAA 

AAV9 GCTGAACTCT TTCATCACGC AGTACAGCAC GGGACA.AGT CAGCGTGGAA 

AAV7 GTTTGCTTCC TTCATCACAC AGTACAGCAC GGGACA.AGT CAGCGTGGAA 

44 2 GCTGGCGTCG TTCATCACGC AGTACAGCAC CGGACA.GGT CAGCGTGGAA 
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Fig. lAAAI 



4301 4350 

42_2 ATCGAGTGGG AACTGCAGAA AGPJKAACAGC AAACGCTGGA ATCCAGAGAT 

42~8 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_15 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

4 2__5b ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

4 2~lb ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

A2Z^3 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42~3a ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_4 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

4 2__5a ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42^10 ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

42_3b ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

42_11 ATCGAGTGGG AACTGCAGAA AGAGAACAGC AAACGCTGGA ATCCAGAGAT 

4 2~6b ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

43_1 ATCGAATGGG AGCTGCAGAA AGA7U\ACAGC AAGCGCTGGA ACCCAGAGAT 

4 3~5 ATCGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ACCCAGAGAT 

4 3_12 ATCGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ACCCAGAGAT 

4 3~20 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 3~21 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 3_23 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 3^2 5 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

44__1 ATTGAATGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAGAT 

44"5 ATTGAATGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAGAT 

223_T0 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_2 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223__4 ATCGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_5 ATCGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_6 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

22 3~7 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

A3_4 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

A3_5 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCGGAAAT 

A3~7 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

A3_3 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

42 12 



AAVl ATTGAATGGG AGCTGCAGAA 

AAV2 ATCGAGTGGG AGCTGCAGAA 

AAV3 ATTGAGTGGG AGCTACAGAA 

AAV8 ATTGAATGGG AGCTGCAGAA 

AAV9 ATCGAGTGGG AGCTGCAGAA 

AAV7 ATCGAGTGGG AGCTGCAGAA 

44 2 ATTGAATGGG AGCTGCAGAA 



AGAAAACAGC AAGCGCTGGA ATCCCGAAGT 
GGAAAACAGC AAACGCTGGA ATCCCGAAAT 
AGAAAACAGC AAACGTTGGA ATCCAGAGAT 
GGAAAACAGC AAGCGCTGGA ACCCCGAGAT 
AGAAAACAGC AAGCGCTGGA ATCCAGAGAT 
GGAAAACAGC AAGCGCTGGA ACCCGGAGAT 
AGAAAACAGC AAACGCTGGA ACCCAGAGAT 
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4351 


42 2 


TCAGTACACC 


42 8 


TCAGTATACT 


42 15 


TCAGTATACT 


42 5b 


TCAGTATACT 


42 lb 


TCAGTATACT 


42 13 


TCAGTATACT 


42 3a 


TCAGTATACT 


42 4 


TCAGTATACT 


42 5a 


TCAGTATACT 


42 10 


TCAGTACACC 


42 3b 


TCAGTACACC 


42 11 


TCAGTACACC 


4 2 6b 


TCAGTACACC 


43 1 


TCAGTATACT 


43 5 


TCAGTATACT 


43 12 


TCAGTATACT 


43 20 


TCAATACACT 


43 21 


TCAATACACT 


43 23 


TCAATACACT 


43 25 


TCAATACACT 


44 1 


TCAATACACT 


44 5 


TCAATACACT 


223 10 


TCAGTACACC 


223 2 


TCAGTACACC 


223 4 


TCAGTACACC 


223 5 


TCAGTACACC 


223 6 


TCAGTACACC 


223 7 


TCAGTACACC 


A3 4 


TCAGTACACC 


A3 5 


TCAGTACACC 


A3 7 


TCAGTACACC 


A3 3 


TCAGTACACC 


42__12 


. . . GTATACT 


AAVl 


GCAGTACACA 


AAV2 


TCAGTACACT 


AAV 3 


TCAGTACACT 


AAV 8 


CCAGTACACC 


AAV9 


CCAGTATACT 


AAV7 


TCAGTACACC 


44 2 


TCAATACACT 



Fig. lAAAJ 



TCAAATTATG CCAAGTCTAA 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACZ\AATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCAAATTATG CCAAGTCTAA 
TCAAATTATG CCAAGTCTAA 
TCAAATTATG CCAAGTCTAA 
TCAAATTATG CCAAGTCTAA 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCC7\ACTACT ACAAATCTAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTACA ACAAGTCGGT 
TCCTVACTACA ACAAGTCGGT 
TCCAACTACA ACAAGTCGGT 
TCCAACTACA ACAAGTCGGT 
TCCAACTACT ACAAATCTAC 
TCCAATTATG CAAAATCTGC 
TCCAACTACA ACAAGTCTGT 
TCCAACTACA ACAAGTCTGT 
TCCAACTACT ACAAATCTAC 
TCAAACTACT ACAAATCTAC 
TCCAACTTTG AAAAGCAGAC 
TCCAACTACT ACAAATCTAC 



4400 

TAAT.GTGGA ATTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
T7VAT.GTGGA ATTTGCTGTC 
TAAT.GTGGA ATTTGCTGTC 
TAAT.GTGGA ATTTGCTGTC 
TAAT.GTGGA ATTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTCGCTGTT 
AAAT.GTGGA CTTTGCTGTT 
TGGA.GTGGA CTTTGCTGTT 
TGGA.GTGGA CTTTGCTGTT 
TGGA.GTGGA CTTTGCTGTT 
TGGA.GTGGA CTTTGCTGTT 
TGGA.GTGGA CTTTGCTGTT 
TGGA.GTGGA CTTTGCTGTT 
GAAT.GTGGA GTTTACCGTG 
GAAT.GTGGA GTTTACCGTG 
GAAT.GTGGA GTTTACCGTG 
GAAT.GTGGA GTTTACCGTG 
AAAT.GTGGA CTTTGCTGTC 
CAAC.GTTGA TTTTACTGTG 
TAATCGTGGA CTT.ACCGTG 
TAAT.GTGGA CTTTACTGTA 
AAGT.GTGGA CTTTGCTGTT 
AAAT.GTGGA CTTTGCTGTC 
TGGT.GTGGA CTTTGCCGTT 
AAAT.GTGGA CTTTGCTGTT 
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4401 4450 

^2_2 AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 2_8 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_15 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_5b AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_lb AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_13 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_3a AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_4 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_5a AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 2_10 AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_3b AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42__11 AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 2_6b AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 3_1 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CTCGTTATCT 

4 3~5 AATACCGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CTCGTTATCT 

4 3__12 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CTCGTTATCT 

4 3~20 AACACGGAAG GAGTTTATAG CGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 3___21 AACACGGAAG GAGTTTATAG CGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 3_23 AACACGGAAG GAGTTTATAG CGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 3^^2 5 AACACGGAGG GGGTTTATAC CGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 4__1 AACACAGATG GCACTTATTC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 4__5 AACACAGATG GCACTTATTC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

22 3_10 GACAGCCAGG GTGTTTACTC TGAGCCT . . . 

223__2 GACAGCCAGG GTGTTTACTC TGAGCCT... 

22 3__4 GACAGCCAGG GTGTTTACTC TGAGCCT... 

223_5 GACAGCCAGG GTGTTTACTC TGAGCCT... 

223_6 GACAGCCAGG GTGTTTACTC TGAGCCT... 

223_7 GACAGCCAGG GTGTTTACTC TGAGCCT... 

A3_4 GACGCAAACG GTGTTTATTC TGAACCCCGC 

A3_5 GACGCAAACG GTGTTTATTC TGAACCCCGC 

A3__7 GACGCAAACG GTGTTTATTC TGAACCCCGC 

A3_3 GACGCAAACG GTGTTTATTC TGAACCCCGC 

42_12 AATACTGAGG GTACTTATTC AGAGCCTCGC 

AAVl GACAACAATG GACTTTATAC TGAGCCTCGC 

AAV2 GATACTAATG GCGTGTATTC AGAGCCTCGC 

AAV3 GACACTAATG GTGTTTATAG TGAACCTCGC 

AAV8 AATACAGAAG GCGTGTACTC TGAACCCCGC 

AAV9 AATACCGAAG GTGTTTACTC TGAGCCTCGC 

AAV7 GACAGCCAGG GTGTTTACTC TGAGCCTCGC 

44 2 AACACAGATG GCACTTATTC TGAGCCTCGC 



CCTATTGGCA 

CCTATTGGCA 
CCTATTGGCA 
CCTATTGGCA 
CCCATTGGCA 
CCCATTGGCA 
CCCATTGGCA 
CCTATTGGAA 
CCCATTGGCA 
CCCATTGGTA 
CCTATTGGCA 
CCCATCGGCA 



CTCGTTACCT 
CTCGTTACCT 
CTCGTTACCT 
CTCGTTACCT 
CCCGTTACCT 
CCCGTTACCT 
CCAGATACCT 
CCCGGTATCT 
CCCGTTACCT 
CTCGTTACCT 
CTCGTTACCT 
CCCGTTACCT 
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Fig . lAAAL 

4451 4500 
VPl -3 stop Poly A s ignal 

42 2 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 
42~8 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGC TAATTCGTTT 

42_15 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

42__5b CACCCGTAAC CTGTAATTGC CTGTTAATCA ATTUUVCCGGT TAATTCGTTT 

42"lb CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 

42~13 CACCCGTAGC CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 

42~3a CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

42__4 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

42 5a CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

42~10 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

4 2~3b CACCCGT7y\C CTGTAATTGC CTGTTAATCA AT7UVACCGGT TAATTCGTTT 

42~11 CACCCGTAAC CTGTAATTAC TTGTTAATCA ATAAACCGGT TGATTCGTTT 

4 2__6b CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

43_1 CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT 

43 5 CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TAATTCGTTT 
43_T2 CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TAATTCGTTT 
4 3_2 0 CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 
4 3~21 CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 
4 3™2 3 CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 
4 3_2 5 CACCCGCZy\C CTGTAATTAC ATGTTAATCA ATAAACCGGT T7VATTCGTTT 

4'4_1 CACCCGTAAT CTGTAATTGC TCGTTAATCA ATAAACCGGT TGATTCGTTT 

44~5 CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TGATTCGTTT 

22 3 To 

223 2 

223 4 

223 5 

223 6 

223 7 

A3_4 TACCCGGAAC TTGTAATTTC CTGTTAATCA ATAAACCGAT TTATGCGTTT 

A3~5 TACCCGGAAC TTGTAATTTC CTGTTAATCA ATAAACCGAT TTATGCGTTT 

A3~7 TACCCGGAAC TTGTAATTTC CTGTTAATCA ATAAACCGAT TTATGCGTTT 

A3~3 TACCCGGAAC TTGTAATTTC CTGTTAATCA ATAAGCCGAT TTATGCGTTT 

4 2_T2 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

AAVl TACCCGTCCC CTGTAATTAC CTGTTAATCA ATAAACCGGT TGATTCGTTT 

AAV2 GACTCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGTT TAATTCGTTT 

AAV3 CACACGAAAC TTGTGAATCC TGGTTAATCA ATAAACCGTT TAATTCGTTT 

AAV8 CACCCGTAAT CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 

AAV9 CACCCGTAAT TTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

AAV7 CACCCGTAAT CTGTAATTGC ATGTTT^TCA ATAAACCGGT TGATTCGTTT 

44 2 CACCCGTAAT CTGTAATTGC TTGTTAAT CA ATAAA CCGGT TGATTCGTTT 
~ vpl-3 stop PolyA signal 



404 



EP1 310 571 A2 



Fig. lAAAM 

4501 4550 



42_2 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

42_8 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

42_15 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

42_5b CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTCGTTTA AACCTGCAGG 

42~lb CAGTTGAACT TTGGTCTC AAGGGCG AATTC 

42_13 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

42_3a CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

4 2_4 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

42_5a CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

42_10 CAGTTGAACT TTGGTC AAGGGCG AATTC 

42_3b CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

42_11 CAGTTGZy\CT TTGGTCTC.T GCGAAGGGCG AATTC 

42_6b CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

43 1 

4 3__5 CAGTTGAACT TTGGTCTC.T GCG7VAGGGCG AATTCGTTTA AACCTGCAGG 

43_T2 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

43_20 CAGTTG7y\CT TTGGTCTC.T GCGAAGGGCG AATTC 

43 21 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

4 3~23 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

4 3__25 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

4 4_1 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

44_5 CAGTTGAACT TTGGTCTC.T GCGAAGGGCG AATTC 

223_l0 

22 3 2 

223 4 

223 5 

223 6 



A3 4 


CAGTTGAACT 


TTGGTCTC 


.T 


A3 5 


CAGTTGAACT 


TTGGTCTC 


.T 


A3 7 


CAGTTGAACT 


TTGGTCTC 


.T 


A3 3 


CAGTTGAACT 


TTGGTCTC 


.T 


42_12 


CAGTTGAACT 


TTGGTCTC 


.T 


AAVl 


CAGTTGAACT 


TTGGTCTCCT 


AAV2 


CAGTTGAACT 


TTGGTCTC 


.T 


AAV3 


CAGTTGAACT 


TTGGCTCT 


.T 


AAV 8 


CAGTTGAACT 


TTGGTCTC 


.T 


AAV 9 


CAGTTGAACT 


TTGGTCTC 


.T 


AAV 7 


CAGTTGAACT 


TTGGTCTCCT 


44 2 


CAGTTGAACT 


TTGGTCTC 


.T 



GCGAAGGGCG AATTCGC . GG CCGCTA. . . . 

GCGAAGGGCG AATTC 

GCGAAGGGCG AATTC 

GCGAAGGGCG AATTCGT.TT AAACCT . . , . 

GCGAAGGGCG AATTC 

GTCCTTCTTA TCTTATCGGT TACCATGGTT 
GCGTATTTCT ..TTCTT.AT CTAGTTTCCA 
GTGCACTTCT TTATCTTTAT CTTGTTTCCA 

GCG 

GCG 

GTGCTTCTTA TCTTATCGGT TTCCATAGCA 
GCGAAGGGCG AATTC 
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Fig, lAAAN 



4551 4600 

4 22 

42 8 

42 15 

42_5b ACTAGTCCCT TTAGTGAGGG TTAATTCTGA G 

42 lb 

42 13 

42 3a 

42 4 

42 5a 

42_10 

42 3b 

4211 

42_6b 

43 1 

4 3 5 AC 

43 12 

43 20 

43 21 

43 23 

43 25 

4 41 

44 5 

223 10 

223 2 

223 4 

223 5 

223 6 

223 7 

A3 4 

A3 5 

A3 7 

A3 3 

42 12 

AAVl ATAGCTTACA CATTAACTGC TTGGTTGCGC T 

AAV2 TGGCTAC . . . GTAGATAAGT AGC 

AAV3 TGGCTACTGC GTAGATAAGC AGCGGCCTGC GGCGCTTGCG CTTCGCGGTT 

AAV8 

AAV9 

AAV7 ACTGGTTACA CATTAACTGC TTGGGTGCGC TTCACGATAA GAACACTGAC 

44 2 
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Fig. lAAAO 



4601 



4650 



42__2 

42_8 
42_15 
4 2_5b 
4 2_lb 
4 2_13 
42_3a 
42_4 
42__5a 
42^10 
4 2_3b 
42_11 
42_6b 
43_1 
43__5 
43_12 
4 3_2 0 
43_21 
4 3^23 
43__2 5 
44_1 
44_5 
223_10 
223_2 
223_4 
2 2 3__5 
223_6 
223_7 
A3___4 
A3__5 
A3_7 
A3_3 
42__12 
AAVl 
AAV2 
AAV3 
AAV8 
AAV 9 
AAV 7 
44 2 



.CTTGGC GTAATCATGG GTCATAG. 



.... TCGCGA TAAAAGACTT ACGTCATCGG GTTACCCCTA GTGATGGAGT 
. . . .ATGGCG GGTTAATCAT TAACTACAAG GA.ACCCCTA GTGATGGAGT 
TACAACTGCT GGTTAATATT TAACTCTCGC CATACCTCTA GTGATGGAGT 



.GTCACCGC GGTACCCCTA GTGATGGAGT 
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Fig. lAAAP 



4651 4700 

42 2 

42 8 

42 15 

42 5b 

42_lb 

42 13 

42 3a 

42 4 

42 5a 

4210 

42 3b 

42 11 

42_6b 

43 1 

43 5 

4312 

43 20 

43 21 

43 23 

43 25 

44 1 

44 5 

223 10 

223 2 

223 4 

223 5 

223 6 

223 7 

A3 4 

A3 5 

A3 7 

A3 3 

42 12 

AAVl TGCCCACTCC CTCTCTGCGC GCTCGCTCGC TCGGTGGGGC CTGCGGACCA 

7VAV2 TGGCCACTCC CTCTCTGCGC GCTCGCTCGC TCACTGAGGC CGGGCGACCA 

AAV3 TGGCCACTCC CTCTATGCGC ACTCGCTCGC TCGGTGGGGC CTGGCGACCA 

AAV8 

AAV9 

AAV7 TGGCCACTCC CTCTATGCGC GCTCGCTCGC TCGGTGGGGC CTGCGGACCA 

44 2 
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Fig. lAAAQ 



4701 4750 

42 2 

4 2 8 

42 15 

42_5b 

42 lb 

42 13 

42_3a 

42 4 

42 5a 

4210 

42 3b 

4211 

42_6b 

43 1 

43 5 

43 12 

43 20 

4 3 21 

43 23 

43 25 

44 1 

44 5 

223_10 

223 2 

223 4 

223 5 

223 6 

223 7 

A3 4 

A3_5 

A3 7 

A3 3 

42 12 

AAVl AAGGTCCGCA GACGGCAGAG CTCTGCTCTG CCGGCCCCAC CGAGCGAGCG 

AAV2 AAGGTCGCCC GACGCCCGGG CTTTGCCCGG GCGGCCTCAG TGAGCGAGCG 

AAV3 AAGGTCGCCA GACGGACGTG CTTTGCACGT CCGGCCCCAC CGAGCGAGCG 

AAV8 

AAV9 

AAV7 AAGGTCCGCA GACGGCAGAG CTCTGCTCTG CCGGCCCCAC CGAGCGAGCG 

44 2 
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Fig. lAAAR 



42_2 

42_8 

4215 

42_5b 

42__lb 

42__13 

42__3a 

42_4 

42_5a 

42_10 

42_3b 

42_11 

42_6b 

431 

43 5 

43_12 

43_20 

43_21 

43_23 

43_25 

4 4__1 

44 5 

223_10 

223__2 

223_4 

223_5 

223__6 

223_7 

A3 4 

A3 5 

A3 7 

A3 3 

42_12 

AAVl AGCGCGCAGA GAGGGAGTGG GCAA 

AAV2 AGCGCGCAGA GAGGGAGTGG CCAA 

AAVS AGTGCGCATA GAGGGAGTGG CCAA 

AAV8 

AAV9 

AAV7 AGCGCGCATA GAGGGAGTGG CCAA 

44 2 
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Fig. 2A 
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Pig. 2B 
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Fig. 2C 
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Fig. 2D 
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Fig. 2E 
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Fig. 2F 
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Fig. 3A 



Met Pro Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 
1 5' 10 15 



Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
20 25 30 



Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 
35 40 45 



Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 



Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 



Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His Val Leu Val Glu 
85 90 95 



Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 
100 105 110 



Arg Glu Lys Leu Val Gin Thr He Tyr Arg Gly Val Glu Pro Thr Leu 
115 120 125 



Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 



Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 



Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr He 
165 170 175 



Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 



Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 
195 200 205 



Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 
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Fig. 3B 



Met Glu Leu Val Gly Trp Leu Vai Asp Arg Giy lie Thr Ser Giu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 
245 250 255 



Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 



lie Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ser 
275 280 285 



Leu Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
290 295 300 



Asn Gly Tyr Asp Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 



Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 335 



Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala Val Pro 
340 345 350 



Phe Tyr Giy Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
355 360 365 



Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
370 375 380 



Lys Vai Val Glu Ser Ala Lys Ala He Leu Giy Gly Ser Lys Vai Arg 
385 390 395 400 



Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
405 410 415 



lie Val Thr Ser Asn Thr Asn Met Cys Ala Vai He Asp Giy Asn Ser 
420 425 430 



Thr Thr Phe Giu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 
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Fig. 3C 



Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 
450 455 460 

Glu Val Lys Glu Phe Phe Arg Trp Ala Ser Asp His Val Thr Glu Val 
465 470 475 480 

Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Ser Lys Arg Pro Ala 
485 490 495 

Pro ASP Asp Ala Asp He Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 
500 505 510 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 
515 520 525 

ASP Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met He Gin Met 
530 535 540 



Leu 



Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn He 



545 550 



555 560 



Cys Phe Thr His Gly Val Arg Asp Cys Leu Glu Cys Phe Pro Gly Val 
565 570 575 



Ser Glu Ser Gin Pro Val Val Arg Lys Lys Thr Tyr Arg Lys Leu Cys 
580 585 590 

Ala He His His Leu Leu Gly Arg Ala Pro Glu He Ala Cys Ser Ala 
595 600 605 

Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu Gin 
610 615 620 
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